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Methods and nucleic acids for the improved detection of colon 
cell proliferative disorders* 

Introduction 

In the United States the annual incidence of colorectal cancer 
is approximately 150,000, with 56,600 individuals dying form 
colorectal cancer each year. The lifetime risk of colorectal 
cancer in the general population is about 5 to 6 percent. 
Despite intensive efforts in recent years in screening and 
early detection of colon cancer, until today most cases are 
diagnosed in an advanced stage with regional or distant 
metastasis. While the therapeutic options include surgery and 
adjuvant or palliative chemotherapy, most patients die from 
progression of their cancer within a few months. Identifying 
the molecular changes that underly the devlopment of colon 
cancer may help to develop new monitering, screening, 
diagnostic and therapeutic options that could improve the 
overall poor prognosis of these patients. 

The current model of colorectal pathogenesis favours a 
stepwise progression of adenomas which includes the development 
of dysplasia and finally signs of invasive cancer. The 
molecular changes underlying this adenoma-carcinoma sequence 
include genetic and epigenetic alterations of tumor suppressor 
genes (APC, p53 , DCC) , the activation of oncogenes (K-ras) and 
the inactivation of DNA mismatch repair genes. Recently, 
further molecular changes and genetic defects have been 
revealed. Thus, activation of the Wnt signalling pathway not 
only includes mutations of the APC gene, but may also result 
from •-catenin mutations. Furthermore, alterations in the TGF- • 
signalling pathway together with its signal transducers SMAD4 
and SMAD2 have been linked to the development of colon cancer. 

Despite recent progress in the understanding of the 
pathogenesis of adenomas and carcinomas of the colon and their 
genetic and molecular changes, the genetic and epigenetic 
changes underlying the development of metastasis are less well 
understood. It is, however, generally well accepted that the 
process of invasion and proteolysis of the extracellular 
matrix, as well as infiltration of the vascular basement 
membrane involve adhesive proteins, such as members of the 



2 



family of integrin receptors, the cadherins, the immunoglobulin 
superfamily, the laminin binding protein and the CD44 receptor. 
Apart from adhesion, the process of metastasis formation also 
includes the induction and regulation of angiogenesis (VEGF, 
bFGF) , the induction of cell proliferation (EGF, HGF, IGF) and 
the activation of proteolytic enzymes (MMPs, TIMPs, uPAR) , as 
well as the inhibition of apoptosis (Bcl-2, Bcl-X) . More 
recently other groups have compared the genetic and molecular 
changes in metastatic lesions to the changes found in primary 
colorectal cancers. Thus, Kleeff et al . reported the loss of 
DOC-2, a candidate tumor suppressor gene, both in primary and 
metastatic colorectal cancer. Furthermore, Zauber et al . 
reported that in their series of 42 colorectal cancers Ki-ras 
mutations in the primary cancers were identical in all of the 
42 paired primary and synchronous metastatic lesions. Similarly 
loss of heterozygosity at the APC locus was identical for 39 
paired carcinomas and synchronous metastasis. The authors 
concluded that for Ki-ras and APC genes the genetic changes in 
metastasis are identical to the primary colorectal cancer. 
However, other groups have found genetic and molecular changes 
in metastatic colon cancers, that are not present in the 
primary cancers. Thus, the development of LOH of chromosome 3p 
in colorectal metastasis has been reported. In addition, using 
comparative genomic hybridization several alterations were 
found in liver metastasis that were unique to metastastic 
lesions (-9q, -llq, and -17q) . 

Apart from mutations aberrant methylation of CpG islands 
has been shown to lead to the transcriptional silencing of 
certain genes that have been previously linked to the 
pathogenesis of various cancers. CpG islands are short 
sequences which are rich in CpG dinucleo tides and can usually 
be found in the 5' region of approximately 50% of all human 
genes. Methylation of the cytosines in these islands leads to 
the loss of gene expression and has been reported in the 
inactivation of the X chromosome and genomic imprinting. 

Recently several groups have also analysed the methylation 
of various genes in colorectal cancer and reported the 
transcriptional silencing by promoter methylation for pl6INK4, 
pl4ARF, pl5lNK4b, MGMT, hMLHl , GSTPl , DAPK, CDHl, TIMP-3 and 
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APC among others. Thus apart from mutational inactivation of 
certain genes, the hypermethylation of these genes also 
contributes significantly to the pathogenesis of this disease. 
In recent years several genes that are methylated in colon 
cancer have been identified by MS-APPCR. This group of genes 
among others, includes TPEF/HPP1 which is frequently methylated 
in colon cancers and which was independently identified by two 
different groups using the MS-APPCR method. See for example, 
Young J, Biden KG, Simms LA, Huggard P, Karamatic R, Eyre HJ, 
Sutherland GR, Herath N, Barker M, Anderson GO", Fitzpatrick DR, 
Ramm GA, Jass *JR, Leggett BA. HPPl : a transmembrane protein- 
encoding . gene commonly methylated in colorectal polyps and 
cancers. Proc Natl Acad Sci USA 2001;98:265-270. 
Multifactorial approach. Cancer diagnostics has traditionally 
relied upon the detection of single molecular markers (e.g. 
gene mutations, elevated PSA levels). Unfortunately, cancer is 
a disease state in which single markers have typically failed 
to detect or differentiate many forms of the disease. Thus, 
assays that recognize only a single marker have been shown to 
be of limited predictive value. A fundamental aspect of this 
invention is that methylation based cancer diagnostics and the 
screening, diagnosis, and therapeutic monitoring of such 
diseases will provide significant improvements over the state- 
of-the-art that uses single marker analyses by the use of a 
selection of multiple markers. The multiplexed analytical 
approach is particularly well suited for cancer diagnostics 
since cancer is not a simple disease, this multi-factorial 
"panel" approach is consistent with the heterogeneous nature of 
cancer, both cytologically and clinically. 

Key to the successful implementation of a panel approach to 
methylation based diagnostic tests is the design and 
development of optimized panels of markers that can 
characterize and distinguish disease states. This patent 
application describes an efficient and unique panel of genes 
the methylation analysis of one or a combination of the members 
of the panel enabling the detection of cell proliferative 
disorders of the prostate with a particularly high sensitivity, 
specificity and/or predictive value. 
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Development of medical tests. Two key evaluative measures of 
any medical screening or diagnostic test are its sensitivity 
and specificity, which measure how well the test performs to 
accurately detect all affected individuals without exception, 
and without falsely including individuals who do not have the 
target disease (predicitive value) . Historically, many 
diagnostic tests have been criticized due to poor sensitivity 
and specificity. 

A true positive (TP) result is where the test is positive and 
the condition is present. A false positive (FP) result is where 
the test is positive but the condition is not present. A true 
negative (TN) result is where the test is negative and the 
condition is not present. A false negative (FN) result is where 
the test is negative but the condition is not present. 

Sensitivity = TP/(TP+FN) 
Specificity = TN/ (FP+TN) 
Predictive value = TP/ (TP+FP) 

Sensitivity is a measure of a test's ability to correctly 
detect the target disease in an individual being tested. A test 
having poor sensitivity produces a high rate of false 
negatives, i.e., individuals who have the disease but are 
falsely identified as being free of that particular disease. 
The potential danger of a false negative is that the diseased 
individual will remain undiagnosed and untreated for some 
period of time, during which the disease may progress to a 
later stage wherein treatments, if any, may be less effective. 
An example of a test that has low sensitivity is a protein- 
based blood test for HIV. This type of test exhibits poor 
sensitivity because it fails to detect the presence of the 
virus until the disease is well established and the virus has 
invaded the bloodstream in substantial numbers. In contrast, an 
example of a test that has high sensitivity is viral-load 
detection using the polymerase chain reaction (PGR) . High 
sensitivity is achieved because this type of test can detect 
very small quantities of the virus. High sensitivity is 
particularly important when the consequences of missing a 
diagnosis are high. 
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Specificity, on the other hand, is a measure of a test's 
ability to identify accurately patients who are free of the 
disease state. A test having poor specificity produces a high 
rate of false positives, i.e., individuals who are falsely 
identified as having the disease. A drawback of false positives 
is that they force patients to undergo unnecessary medical 
procedures treatments with their attendant risks, emotional and 
financial stresses, and which could have adverse effects on the 
patient's health. A feature of diseases which makes it 
difficult to develop diagnostic tests with high specificity is 
that disease mechanisms, particularly in cancer, often involve 
a plurality of genes and proteins. Additionally, certain 
proteins may be elevated for reasons unrelated to a disease 
state. An example of a test that has high specificity is a 
gene-based test that can detect a p53 mutation. Specificity is 
important when the cost or risk associated with further 
diagnostic procedures or further medical intervention are very 
high . 

Bisulfite modification of DNA is an art-recognized tool used 
to assess CpG methylation status. 5-methylcytosine is the most 
frequent covalent base modification in the DNA of eukaryotic 
cells. It plays a role, for example, in the regulation of the 
transcription, in genetic imprinting, and in tumorigenesis . 
Therefore, the identification of 5-methylcytosine as a 
component of genetic information is of considerable interest. 
However, 5-methylcytosine positions cannot be identified by 
sequencing, because 5-methylcytosine has the same base pairing 
behavior as cytosine. Moreover, the epigenetic information 
carried by 5-methylcytosine is completely lost during, e.g., 
PCR amplification. 

The most frequently used method for analyzing DNA for the 
presence of 5-methylcytosine is based upon the specific 
reaction of bisulfite with cytosine whereby, upon subsequent 
alkaline hydrolysis, cytosine is converted to uracil which 
corresponds to thymine in its base pairing behavior. 
Significantly, however, 5-methylcytosine remains unmodified 
under these conditions. Consequently, the original DNA is 
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converted in such a manner that methyl cytosine, which 
originally could not be distinguished from cytosine by its 
hybridization behavior, can now be detected as the only 
remaining cytosine using standard, art-recognized molecular 
biological techniques, for example, by amplification and 
hybridization, or by sequencing. All of these techniques are 
based on differential base pairing properties, which can now be 
fully exploited. 

The prior art, in terms of sensitivity, is defined by a method 
comprising enclosing the DNA to be analyzed in an agarose 
matrix, thereby preventing the diffusion and renaturation of 
the DNA (bisulfite only reacts with single-stranded DNA) , and 
replacing all precipitation and purification steps with fast 
dialysis (Olek A, et al . , A modified and improved method for 
bisulfite based cytosine methylation analysis, Nucleic Acids 
Res. 24:5064-6, 1996). It is thus possible to analyze 

individual cells for methylation status, illustrating the 
utility and sensitivity of the method. An overview of art- 
recognized methods for detecting 5-methylcytosine is provided 
by Rein, T., et al M Nucleic Acids Res., 26:2255, 1998. 
The bisulfite technique, barring few exceptions (e.gr. , 
Zeschnigk M, et al . , Eur J Hum Genet. 5:94-98, 1997), is 
currently only used in research. In all instances, short, 
specific fragments of a known gene are amplified subsequent to 
a bisulfite treatment, and either completely sequenced (Olek & 
Walter, Nat Genet. 1997 17:275-6, 1997), subjected to one or 
more primer extension reactions (Gonzalgo & Jones, Nucleic 
Acids Res., 25:2529-31, 1997; VJO 95/00669; U.S. Patent No. 
6,251,594) to analyze individual cytosine positions, or treated 
by enzymatic digestion (Xiong & Laird, Nucleic Acids Res., 
25:2532-4, 1997). Detection by hybridization has also been 
described in the art (Olek et al . , WO 99/28498). Additionally, 
use of the bisulfite technique for methylation detection with 
respect to individual genes has been described (Grigg & Clark, 
Bioessays, 16:431-6, 1994; Zeschnigk M, et al . , Hum Mol Genet., 
6:387-95, 1997; Feil R, et al . , Nucleic Acids Res., 22:695-, 
1994; Martin V, et al . , Gene, 157:261-4, 1995; WO 9746705 and 
WO 9515373) . 

Bisulfite Methylation Assay Procedures. Various methylation 
assay procedures are known in the art, and can be used in 
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conjunction with the present invention. These assays allow for 
determination of the methylation state of one or a plurality of 
CpG dinucleo tides (e.g., CpG islands) within a DNA sequence. 
Such assays involve, among other techniques, DNA sequencing of 
bisulf ite-treated DNA, PCR (for sequence-specific 

amplification) , Southern blot analysis, and use of methylation- 
sensitive restriction enzymes. 

For example, genomic sequencing has been simplified for 
analysis of DNA methylation patterns and 5 -methyl cytosine 
distribution by using bisulfite treatment (Frommer et al . , 
Proc. Natl. Acad. Sci . USA 89:1827-1831, 1992). Additionally, 
restriction enzyme digestion of PCR products amplified from 
bisulf ite-converted DNA is used, e.g., the method described by 
Sadri & Hornsby (Nucl . Acids Res. 24:5058-5059, 1996), or COBRA 
(Combined Bisulfite Restriction Analysis) (Xiong & Laird, 
Nucleic Acids Res. 25:2532-2534, 1997). 

COBRA. COBRA analysis is a quantitative methylation assay 
useful for determining DNA methylation levels at specific gene 
loci in small amounts of genomic DNA (Xiong & Laird, Nucleic 
Acids Res. 25:2532-2534, 1997). Briefly, restriction enzyme 
digestion is used to reveal methylat ion-dependent sequence 
differences in PCR products of sodium bisulf ite-treated DNA. 
Methylation-dependent sequence differences are first introduced 
into the genomic DNA by standard bisulfite treatment according 
to the procedure described by Frommer et al . (Proc. Natl. Acad. 
Sci. USA 89:1827-1831, 1992). PCR amplification of the 
bisulfite converted DNA is then performed using primers 
specific for the interested CpG islands, followed by 
restriction endonuclease digestion, gel electrophoresis, and 
detection using specific, labeled hybridization probes. 
Methylation levels in the original DNA sample are represented 
by the relative amounts of digested and undigested PCR product 
in a linearly quantitative fashion across a wide spectrum of 
DNA methylation levels. In addition, this technique can be 
reliably applied to DNA obtained from microdissected paraffin- 
embedded tissue samples. 

Other assays used in the art include "MethyLight" (a 
fluorescence-based real-time PCR technique) (Eads et al . , 
Cancer Res. 59:2302-2306, 1999), Ms-SNuPE (Methylation- 
sensitive Single Nucleotide Primer Extension) reactions 
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(Gonzalgo & Jones, Nucleic Acids Res. 25:2529-2531, 1997), 
methylation-specif ic PCR ( U MSP"; Herman et al . , Proc. Natl. 
Acad. Sci. USA 93:9821-9826, 1996; US Patent No. 5,786,146), 
and methylated CpG island amplification ( n MCA" ; Toyota et al . , 
Cancer Res. 59:2307-12, 1999). These may be used alone or in 
combination with other of these methods. 

MethyLight. The MethyLight assay is a high- throughput 
Quantitative methylation assay that utilizes fluorescence-based 
real-time PCR (TaqMan*) technology that requires no further 
manipulations after the PCR step (Eads et al . , Cancer Res. 
59:2302-2306, 1999). Briefly, the MethyLight process begins 
with a mixed sample of genomic DNA that is converted, in a 
sodium bisulfite reaction, to a mixed pool of me thy la t ion- 
dependent sequence differences according to standard procedures 
(the bisulfite process converts unmethylated cytosine residues 
to uracil) . Fluorescence-based PCR is then performed either in 
an "unbiased" (with primers that do not overlap known CpG 
methylation sites) PCR reaction, or in a "biased" (with PCR 
primers that overlap known CpG dinucleotides ) reaction. 
Sequence discrimination can occur either at the level of the 
amplification process or at the level of the fluorescence 
detection process, or both. 

The MethyLight assay may be used as a quantitative test 
for methylation patterns in the genomic DNA sample, wherein 
sequence discrimination occurs at the level of probe 
hybridization. In this quantitative version, the PCR reaction 
provides for unbiased amplification in the presence of a 
fluorescent probe that overlaps a particular putative 
methylation site. An unbiased control for the amount of input 
DNA is provided by a reaction in which neither the primers, nor 
the probe overlie any CpG dinucleotides. Alternatively, a 
qualitative test for genomic methylation is achieved by probing 
of the biased PCR pool with either control oligonucleotides 
that do not * cover" known methylation sites (a fluorescence- 
based version of the W MSP" technique) , or with oligonucleotides 
covering potential methylation sites. 

The MethyLight process can by used with a w TaqMan<S>" probe 
in the amplification process. For example, double- stranded 
genomic DNA is treated with sodium bisulfite and subjected to 
one of two sets of PCR reactions using TaqMan® probes; e.g., 
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with either biased primers and TaqMan® probe, or unbiased 
primers and TagMan® probe. The TaqMan® probe is dual-labeled 
with fluorescent "reporter" and "quencher" molecules, and is 
designed to be specific for a relatively high GC content region 
so that it melts out at about 10°C higher temperature in the 
PCR cycle than the forward or reverse primers . This allows the 
TaqMan® probe to remain fully hybridized during the PCR 
annealing/ extension step. As the Taq polymerase enzymatically 
synthesizes a new strand during PCR, it will eventually reach 
the annealed TaqMan® probe. The Taq polymerase 5' to 3' 
endonuclease activity will then displace the TaqMan® probe by 
digesting it to release the fluorescent reporter molecule for 
quantitative detection of its now unquenched signal using a 
real-time fluorescent detection system. 

Alternatively the MethylLight process can be used with 
% Lightcycler ' probes. A LightCycler probe is a pair of single- 
stranded fluorescent-labeled oligonucleotides. The first 
oligonucleotide probe is labeled at its 3 1 end with a donor 
fluorophore dye and the second is labeled at its 5 1 end with an 
acceptor fluorophore dyes. The free 3' hydroxy 1 group of the 
second probe is blocked with a phosphate group to prevent 
polymerase mediated extension. 

During the annealing step of real-time quantitative PCR, the 
PCR primers and the LightCycler probes hybridize to their 
specific target regions causing the donor dye to come into 
close proximity to the acceptor dye. When the donor dye is 
excited by light, energy is transferred by Fluorescence 
Resonance Energy Transfer (FRET) from the donor to the acceptor 
dye. The energy transfer causes the acceptor dye to emit 
fluorescence wherein the increase of measured fluorescence 
signal is directly proportional to the amount of target DNA. 

Typical reagents (e.gr. , as might be found in a typical 
Me thyLight -based kit) for MethyLight analysis may include, but 
are not limited to: PCR primers for specific gene (or 
methylation-altered DNA sequence or CpG island) ; TaqMan® and/or 
LightCycler probes; optimized PCR buffers and deoxynucleo tides ; 
and Taq polymerase. 

Ms-SNuPE. The Ms-SNuPE technique is a quantitative method 
for assessing methylation differences at specific CpG sites 
based on bisulfite treatment of DNA, followed by single- 
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nucleotide primer extension (Gonzalgo & Jones, Nucleic Acids 
Res. 25:2529-2531, 1997). Briefly, genomic DNA is reacted with 
sodium bisulfite to convert unmethylated cytosine to uracil 
while leaving 5 -methyl cytosine unchanged. Amplification of the 
desired target sequence is then performed using PCR primers 
specific for bisulf ite-converted DNA, and the resulting product 
is isolated and used as a template for methylation analysis at 
the CpG site(s) of interest. Small amounts of DNA can be 
analyzed (e.g., microdissected pathology sections), and it 
avoids utilization of restriction enzymes for determining the 
methylation status at CpG sites. 

Typical reagents (e.g., as might be found in a typical Ms- 
SNuPE-based kit) for Ms-SNuPE analysis may include, but are not 
limited to: PCR primers for specific gene (or methylation- 
altered DNA sequence or CpG island) ; optimized PCR buffers and 
deoxynucleo tides; gel extraction kit; positive control primers ; 
Ms-SNuPE primers for specific gene; reaction buffer (for the 
Ms-SNuPE reaction); and radioactive nucleotides. Additionally, 
bisulfite conversion reagents may include: DNA denaturation 
buffer; sulfonation buffer; DNA recovery regents or kit (e.g., 
precipitation, ultrafiltration, affinity column) ; desulf onation 
buffer; and DNA recovery components. 

MSP. MSP (methylation-specif ic PCR) allows for assessing the 
methylation status of virtually any group of CpG sites within a 
CpG island, independent of the use of methylation-sensitive 
restriction enzymes (Herman et al . Proc. Natl. Acad. Sci . USA 
93:9821-9826, 1996; US Patent No. 5,786,146). Briefly, DNA is 
modified by sodium bisulfite converting all unmethylated, but 
not methylated cytosines to uracil, and subsequently amplified 
with primers specific for methylated versus unmethylated DNA. 
This technique has been described in United States Patent No. 
6,265,171 to Herman. The use of methylation status specific 
primers for the amplification of bisulfite treated DNA allows 
the differentiation between methylated and unmethylated nucleic 
acids. MSP primers pairs contain at least one primer which 
hybridizes to a bisulfite treated CpG dinucleotide . Therefore, 
the sequence of said primers comprises at least one CpG 
dinucleotide. MSP primers specific for non-methylated DNA 
contain a W T' at the 3' position of the C position in the CpG. 
MSP requires only small quantities of DNA, is sensitive to 0.1% 
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methylated alleles of a given CpG island locus, and can be 
performed on DNA extracted from paraffin-embedded samples. 
Typical reagents (e.gr. , as might be found in a typical MSP- 
based kit) for MSP analysis may include, but are not limited 
to: methylated and unmethylated PCR primers for specific gene 
(or me thylat ion-altered DNA sequence or CpG island) , optimized 
PCR buffers and deoxynucleotides , and specific probes. 

Pronounced need in the art. Therefore, in view of the 
incidence of colon cancer there is a substantial need in the 
art for the development of molecular markers that could be used 
for the early detection of colon cell proliferative disorders, 
in particular colon cancer. Additionally, there is a 

pronounced need in the art for the development of molecular 
markers that could be used to provide sensitive, accurate and 
non-invasive methods (as opposed to, e.g., biopsy) for the 
diagnosis, prognosis and treatment of colon cell proliferative 
disorders . 

DETAILED DESCRIPTION OF THE INVENTION 

For the purposes of the following invention the sensitivity and 
specificity refer to values calculated by reference to a sample 
set according to that described in the herein contained 
examples . 
Definitions : 

The term » Observed/ Expected Ratio" ("O/E Ratio") refers to the 
frequency of CpG dinucleotides within a particular DNA 
sequence, and corresponds to the [number of CpG sites / (number 
of C bases • number of G bases)] • band length for each 
fragment . 

The term w CpG island" refers to a contiguous region of genomic 
DNA that satisfies the criteria of (1) having a frequency of 
CpG dinucleotides corresponding to an * Observed/ Expected Ratio" 
>0 . 6 , and (2 ) having a tt GC Content" >0 . 5 . CpG islands are 
typically, but not always, between about 0.2 to about 1 kb in 
length. 

The term "methylation state" or "methylation status" refers to 
the presence or absence of 5-methylcytosine <*5-mCyt") at one 
or a plurality of CpG dinucleotides within a DNA sequence. 
Methylation states at one or more particular palindromic CpG 
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methylation sites (each having two CpG CpG dinucleotide 
sequences) within a DNA sequence include "unmethylated, " 
n fully-methylated" and "hemi -methylated. * 

The term "hemi -methylation" or "hemimethylation" refers to the 
methylation state of a palindromic CpG methylation site, where 
only a single cytosine in one of the two CpG dinucleotide 
sequences of the palindromic CpG methylation site is methylated 

(e.g., 5'-CC M GG-3' (top strand): 3 ' -GGCC-5 ' (bottom strand) ) . 
The term "hypermethylation" refers to the average methylation 
state corresponding to an increased presence of 5-mCyt at one 
or a plurality of CpG dinucleotides within a DNA sequence of a 
test DNA sample, relative to the amount of 5-mCyt found at 
corresponding CpG dinucleotides within a normal control DNA 
sample . 

The term *hypomethylation" refers to the average methylation 
state corresponding to a decreased presence of 5-mCyt at one or 
a plurality of CpG dinucleotides within a DNA sequence of a 
test DNA sample, relative to the amount of 5-mCyt found at 
corresponding CpG dinucleotides within a normal control DNA 
sample . 

The term "microarray" refers broadly to both *DNA microarrays, " 
and % DNA chip(s),' as recognized in the art, encompasses all 
art- recognized solid supports, and encompasses all methods for 
affixing nucleic acid molecules thereto or synthesis of nucleic 
acids thereon. 

"Genetic parameters" are mutations and polymorphisms of genes 
and sequences further required for their regulation. To be 
designated as mutations are, in particular, insertions, 
deletions, point mutations, inversions and polymorphisms and, 
particularly preferred, SNPs (single nucleotide polymorphisms) . 
"Epigenetic parameters" are, in particular, cytosine 
methyl at ions . Further epigenetic parameters include, for 
example, the acetylation of histones which, however, cannot be 
directly analyzed using the described method but which, in 
turn, correlate with the DNA methylation. 

The term "bisulfite reagent" refers to a reagent comprising 
bisulfite, disulfite, hydrogen sulfite or combinations thereof, 
useful as disclosed herein to distinguish between methylated 
and unmethylated CpG dinucleotide sequences. 
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The term "Methylation assay" refers to any assay for 
determining the methylation state of one or more CpG 
dinucleotide sequences within a sequence of DNA. 

The term "MS.AP-PCR" (Methylation-Sensi tive Arbitrarily- Primed 
Polymerase Chain Reaction) refers to the art-recognized 
technology that allows for a global scan of the genome using 
CG-rich primers to focus on the regions most likely to contain 
CpG dinucleotides, and described by Gonzalgo et al . , Cancer 
Research 57:594-599, 1997. 

The term * Me thy Light™" refers to the art-recognized 
fluorescence-based real-time PGR technique described by Eads et 
al., Cancer Res. 59:2302-2306, 1999. 

The term "HeavyMethyl" assay, in the embodiment thereof 
implemented herein, refers to an assay comprising the use of 
methylation specific blocking probes covering CpG positions 
between the amplification primers. 

The term "Ms-SNuPE" (Methylation-sensitive Single Nucleotide 
Primer Extension) refers to the art-recognized assay described 
by Gonzalgo & Jones, Nucleic Acids Res. 25:2529-2531, 1997. 
The term *MSP" (Methylation-specif ic PCR) refers to the art- 
recognized methylation assay described by Herman et al . Proc. 
Natl. Acad. Sci. USA 93:9821-9826, 1996, and by US Patent No. 
5,786, 146 . 

The term "COBRA" (Combined Bisulfite Restriction Analysis) 
refers to the art-recognized methylation assay described by 
Xiong & Laird, Nucleic Acids Res. 25:2532-2534, 1997. 
The term "MCA" (Methylated CpG Island Amplification) refers to 
the methylation assay described by Toyota et al . , Cancer Res. 
59:2307-12, 1999, and in WO 00/26401A1. 

The term "hybridization" is to be understood as a bond of an 
oligonucleotide to a complementary sequence along the lines of 
the Watson-Crick base pairings in the sample DNA, forming a 
duplex structure. 

"Stringent hybridization conditions," as defined herein, 
involve hybridizing at 68°C in 5x SSC/5x Denhardt's 
solution/ 1 . 0% SDS, and washing in 0.2x SSC/0.1% SDS at room 
temperature, or involve the art-recognized equivalent thereof 
(e.g., conditions in which a hybridization is carried out at 
60°C in 2.5 x SSC buffer, followed by several washing steps at 
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37°C in a low buffer concentration, and remains stable) . 
Moderately stringent conditions, as defined herein, involve 
including washing in 3x SSC at 42°C, or the art-recognized 
equivalent thereof . The parameters of salt concentration and 
temperature can be varied to achieve the optimal level of 
identity between the probe and the target nucleic acid. 
Guidance regarding such conditions is available in the art, for 
example, by Sambrook et al . , 1989, Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, N.Y.; and Ausubel 
et al. (eds.), 1995, Current Protocols in Molecular Biology, 
(John Wiley & Sons, N.Y.) at Unit 2.10. 

Overview : 

Despite intensive efforts to improve screening and early 
detection of colon cell proliferative disorders, most cases are 
diagnosed in an advanced stage with regional or distant 
metastasis which are associated with poor survival. The herein 
described invention discloses epigenetic markers that have 
novel utility for the analysis of colon cell proliferative 
disorders combined with sensitive assay methods for the 
improved detection, classification, treatment and overall 
prognosis of said disorders. The invention presents 
improvements over the state of the art in that it provides a 
means for the detection, classification and prognosis of colon 
cell proliferative disorders by analysis of a gene panel, with 
a high sensitivity and specificity The invention presents 
further improvements in that the 'gene panel' consists of only 
15 genes and/or their regulatory sequences, thereby enabling 
a highly sensitive and specific but time and cost effective 
analysis of a limited number of genes and/or their regulatory 
sequences . 

In one aspect, the present invention provides for the use of 
the bisulfite technique, in combination with one or more 
methylation assays, for determination of the methylation status 
of CpG dinuclotide sequences within sequences from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO: 15. According to the 
present invention, determination of the methylation status of 
CpG dinuclotide sequences within sequences from the group 
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consisting of SEQ ID NO: 1 to SEQ ID NO: 15 has diagnostic and 
prognostic utility. 

GENOMIC SEQUENCES ACCORDING TO SEQ ID NO : 1 to SEQ ID NO: 15, 
AND TREATED VARIANTS THEREOF ACCORDING TO SEQ ID NO : 16 to SEQ 
ID NO: 75, WERE DETERMINED TO HAVE UTILITY FOR THE DETECTION, 
CLASSIFICATION AND /OR TREATMENT OF COLON CELL PROLIFERATIVE 
DISORDERS . 

In one aspect the present invention provides a selection 
of genetic sequences according to Table 1. One or more of these 
genes are analysed in the form of a 'gene panel '.This aspect of 
the invention is further based upon the analysis of methylation 
levels within one or more genomic sequences taken from the 
group consisting SEQ ID NO: 1 to SEQ ID NO: 15. 

Particular embodiments of the present invention provide a 
novel application of the analysis of methylation levels and/or 
patterns within said sequences that enables a precise 
detection, classification, treatment and overall prognosis of 
colon cell proliferative disorders. Early detection of colon 
cell proliferative disorders is directly linked with disease 
prognosis, and the disclosed method thereby enables the 
physician and patient to make better and more informed 
treatment decisions . 

Detailed description 

The present invention is based upon the analysis of 
methylation levels within one or more genomic sequences taken 
from the group according to Table 1 and/or their regulatory 
sequences. Accordingly, the invention also disclose the 
genomic sequences of said genes in SEQ ID NO: 1 to SEQ ID NO: 
15, according to Table 1 . Additional embodiments provide 
modified variants of SEQ ID NO: 1 to SEQ ID NO: 15, as well as 
oligonucleotides and/or PNA-oligomers for analysis of cytosine 
methylation patterns within SEQ ID NO: 1 to SEQ ID NO: 15. 

According to the present invention hypermethylation of the 
genomic sequences according to Table 1 and/or their regulatory 
sequences is correlated with the presence of colon cell 
Proliferative disorders. The present invention discloses the 
analysis of methylation within said genes and/ or their 
regulatory sequences in the form of a panel enabling the 
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improved detection, classification, treatment and overall 
prognosis of colon cell proliferative disorders. 

Hypermethylation of one or more of the the genomic 
sequences according to Table 1 and/ or their regulatory 
sequences being indicative of the presence of colon carcinoma. 
Early detection of colon carcinoma increases the chances of 
patient survival . 

An objective of the invention comprises analysis of the 
methylation state of one or more CpG dinucleotides within at 
least one of the genomic sequences selected from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO: 15 and sequences 
complementary thereto. 

Wherein the object of the analysis is the detection of 
colon cell proliferative disorders it is preferred that the 
methylation of one or more sequences selected from the group 
consisting SEQ ID NO: 1 to SEQ ID NO: 15 are analysed. 

The disclosed invention provides treated nucleic acids, 
derived from genomic SEQ ID NO; 1 to SEQ ID NO: 15, wherein the 
treatment is suitable to convert at least one unmethylated 
cytosine base of the genomic DNA sequence to uracil or another 
base that is detectably dissimilar to cytosine in terms of 
hybridization. The genomic sequences in question may comprise 
one, or more, consecutive or random methylated CpG positions. 
Said treatment preferrably comprises use of a reagent selected 
from the group consisting of bisulfite, hydrogen sulfite, 
disulfite, and combinations thereof .In a preferred embodiment 
of the invention, the objective comprises analysis of a 
modified nucleic acid comprising a sequence of at least 16 
contiguous nucleotide bases in length of a sequence selected 
from the group consisting of SEQ ID NO: 16 to SEQ ID NO: 75, 
wherein said sequence comprises at least one CpG, TpA or CpA 
dinucleotide and sequences complementary thereto. The sequences 
of SEQ ID NO: 16 to SEQ ID NO: 75 provide modified versions of 
the nucleic acid according to SEQ ID NO: 1 to SEQ ID NO: 15, 
wherein the modification of each genomic sequence results in 
the synthesis of a nucleic acid having a sequence that is 
unique and distinct from said genomic sequence as follows. For 
each sense strand genomic DNA, e.g., SEQ ID NO:l, four 
converted versions are disclosed. A first version wherein 



17 



"C"»is converted to *T, * but w CpG" remains w CpG" (i.e., 
corresponds to case where, for the genomic sequence, all tt C" 
residues of CpG dinucleotide sequences are methylated and are 
thus not converted) ; a second version discloses the complement 
of the disclosed genomic DNA sequence (i.e. antisense strand), 
wherein w C"»is converted to w T, " but "CpG" remains *CpG" (i.e., 
corresponds to case where, for all "C" residues of CpG 
dinucleotide sequences are methylated and are thus not 
converted) . The % upmethylated' converted sequences of SEQ ID 
NO: 1 to SEQ ID NO: 15 correspond to SEQ ID NO: 16 to SEQ ID 
NO: 45. A third chemically converted version of each genomic 
sequences is provided, wherein U C"« is converted to W T" for all 
*C" residues, including those of "CpG" dinucleotide sequences 
(i.e., corresponds to case where, for the genomic sequences, 
all *C" residues of CpG dinucleotide sequences are 
unmethylated) ; a final chemically converted version of each 
sequence, discloses the complement of the disclosed genomic DNA 
sequence (i.e. antisense strand), wherein *C • is converted 
to*T" for all "C" residues, including those of w CpG" 
dinucleotide sequences (i.e., corresponds to case where, for 
the complement (antisense strand) of each genomic sequence, all 
W C" residues of CpG dinucleotide sequences are unmethylated) . 
The "downmethylated' converted sequences of SEQ ID NO: 1 to SEQ 
ID NO: 15 correspond to SEQ ID NO: 46 to SEQ ID NO: 75. 

In a preferred embodiment, such analysis comprises the use 
of an oligonucleotide or oligomer for detecting the cytosine 
methylation state within genomic or pretreated DNA, according 
to SEQ ID NO: 1 to SEQ ID NO: 75. Said oligonucleotide or 
oligomer comprising a nucleic acid sequence having a length of 
at least nine (9) nucleotides which hybridizes, under 
moderately stringent or stringent conditions (as defined herein 
above) , to a pretreated nucleic acid sequence according to SEQ 
ID NO: 16 to SEQ ID NO: 7 5 and/or sequences complementary 
thereto, or to a genomic sequence according to SEQ ID NO: 1 to 
SEQ ID NO: 15 and/or sequences complementary thereto. 

Thus, the present invention includes nucleic acid 
molecules (e.g., oligonucleotides and peptide nucleic acid 
(PNA) molecules (PNA- oligomers) ) that hybridize under 
moderately stringent and/or stringent hybridization conditions 
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to all or a portion of the sequences SEQ ID NO: 1 to SEQ ID NO: 
75, or to the complements thereof. The hybridizing portion of 
the hybridizing nucleic acids is typically at least 9, 15, 20, 
25, 30 or 3 5 nucleotides in length. However, longer molecules 
have inventive utility, and are thus within the scope of the 
present invention. 

Preferably, the hybridizing portion of the inventive 
hybridizing nucleic acids is at least 95%, or at least 98%, or 
100% identical to the sequence, or to a portion thereof of SEQ 
ID NO: 1 to SEQ ID NO: 75, or to the complements thereof. 

Hybridizing nucleic acids of the type described herein can 
be used, for example, as a primer (e.g., a PCR primer), or a 
diagnostic and/or prognostic probe or primer. Preferably, 
hybridization of the oligonucleotide probe to a nucleic acid 
sample is performed under stringent conditions and the probe is 
100% identical to the target sequence. Nucleic acid duplex or 
hybrid stability is expressed as the melting temperature or Tm, 
which is the temperature at which a probe dissociates from a 
target DNA. This melting temperature is used to define the 
required stringency conditions. 

For target sequences that are related and substantially 
identical to the corresponding sequence of SEQ ID NO: 1 to SEQ 
ID NO: 15 (such as allelic variants and SNPs) , rather than 
identical, it is useful to first establish the lowest 
temperature at which only homologous hybridization occurs with 
a particular concentration of salt (e.g., SSC or SSPE) . Then, 
assuming that 1% mismatching results in a 1°C decrease in the 
Tm, the temperature of the final wash in the hybridization 
reaction is reduced accordingly (for example, if sequences 
having > 95% identity with the probe are sought, the final wash 
temperature is decreased by 5°C) . In practice, the change in 
Tm can be between 0 . 5°C and 1 . 5°C per 1% mismatch. 

Examples of inventive oligonucleotides of length x (in 
nucleotides) , as indicated by polynucleotide positions with 
reference to, e.gr. , SEQ ID NO:l, include those corresponding to 
sets (sense and antisense sets) of consecutively overlapping 
oligonucleotides of length X, where the oligonucleotides within 
each consecutively overlapping set (corresponding to a given X 
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value) are defined as the finite set of Z oligonucleotides from 
nucleotide positions: 

n to (n + (X-l) ) ; 

where n=l , 2, 3 , ... (Y- (X-l) ) ; 

where Y equals the length (nucleotides or base 

pairs) ; 

where X equals the common length (in nucleotides) of each 
oligonucleotide in the set (e.gr. , X=20 for a set of 
consecutively overlapping 20-mers) ; and where the number (Z) of 
consecutively overlapping oligomers of length X for a given SEQ 
ID NO of length Y is equal to Y-(X-l) . 

Preferably, the set is limited to those oligomers that 
comprise at least one CpG, TpG or CpA dinucleotide . 



The present invention encompasses, for eacii of SEQ ID NO: 
1 to SEQ ID NO: 75 (sense and antisense) , multiple 
consecutively overlapping sets of oligonucleotides or modified 
oligonucleotides of length X, where, e.g., X= 9, 10, 17, 20, 
22, 23, 25, 27, 30 or 35 nucleotides. 

The oligonucleotides or oligomers according to the present 
invention constitute effective tools useful to ascertain 
genetic and epigenetic parameters of the genomic sequence 
corresponding to SEQ ID NO: 1 to SEQ ID NO: 15. Preferred sets 
of such oligonucleotides or modified oligonucleotides of length 
X are those consecutively overlapping sets of oligomers 
corresponding to SEQ ID NO: 1 to SEQ ID NO: 75 (and to the 
complements thereof) . Preferably, said oligomers comprise at 
least one CpG, TpG or CpA dinucleotide. 

Particularly preferred oligonucleotides or oligomers 
according to the present invention are those in which the 
cytosine of the CpG dinucleotide (or of the corresponding 
converted TpG or CpA dinculeotide) sequences is within the 
middle third of the oligonucleotide; that is, where the 
oligonucleotide is, for example, 13 bases in length, the CpG, 
TpG or CpA dinucleotide is positioned within the fifth to ninth 
nucleotide from the 5' -end. 

The oligonucleotides of the invention can also be modified 
by chemically linking the oligonucleotide to one or more 
moieties or conjugates to enhance the activity, stability or 
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detection of the oligonucleotide- Such moieties or conjugates 
include chromophores, fluorophors, lipids such as cholesterol, 
cholic acid, thioether, aliphatic chains, phospholipids, 
polyamines, polyethylene glycol (PEG), palmityl moieties, and 
others as disclosed in, for example, United States Patent 
Numbers 5,514,758, 5,565,552, 5,567,810, 5,574,142, 5,585,481, 
5,587,371, 5,597,696 and 5,958,773. The probes may also exist 
in the form of a PNA (peptide nucleic acid) which has 
particularly preferred pairing properties. Thus, the 
oligonucleotide may include other appended groups such as 
peptides, and may include hybridization-triggered cleavage 
agents (Krol et al . , BloTechniques 6:958-976, 1988) or 
intercalating agents (Zon, Pharm. Res. 5:539-549, 1988). To 
this end, the oligonucleotide may be conjugated to another 
molecule, e.gr. , a chromophore, fluorophor, peptide, 
hybridization- triggered cross-linking agent, transport agent, 
hybridization- triggered cleavage agent, etc. 

The oligonucleotide may also comprise at least one art- 
recognized modified sugar and/or base moiety, or may comprise a 
modified backbone or non-natural internucleoside linkage. 

The oligonucleotides or oligomers according to particular 
embodiments of the present invention are typically used in 
'sets,' which contain at least one oligomer for analysis of 
each of the CpG dinucleotides of genomic sequence SEQ ID NO: 1 
to SEQ ID NO: 15 and sequences complementary thereto, or to the 
corresponding CpG, TpG or CpA dinucleotide within a sequence of 
the pretreated nucleic acids according to SEQ ID NO: 16 to SEQ 
ID NO: 75 and sequences complementary thereto. However, it is 
anticipated that for economic or other factors it may be 
preferable to analyze a limited selection of the CpG 
dinucleotides within said sequences, and the content of the set 
of oligonucleotides is altered accordingly. 

Therefore, in particular embodiments, the present 
invention provides a set of at least two (2) (oligonucleotides 
and/or PNA-oligomers ) useful for detecting the cytosine 
methylation state in pretreated genomic DNA (SEQ ID NO: 16 to 
SEQ ID NO: 75), or in genomic DNA (SEQ ID NO: 1 to SEQ ID NO: 
15 and sequences complementary thereto) . These probes enable 
diagnosis, classification and/or therapy of genetic and 
epigenetic parameters of colon cell proliferative disorders. 
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The set of oligomers may also be used for detecting single 
nucleotide polymorphisms (SNPs) in pretreated genomic DNA (SEQ 
ID NO: 16 to SEQ ID NO: 75), or in genomic DNA (SEQ ID NO: 1 to 
SEQ ID NO: 15 and sequences complementary thereto) . 

In preferred embodiments, at least one, and more 
preferably all members of a set of oligonucleotides is bound to 
a solid phase. 

In further embodiments, the present invention provides a 
set of at least two (2) oligonucleotides that are used as 
'primer' oligonucleotides for amplifying DNA sequences of one 
of SEQ ID NO: 1 to SEQ ID NO: 75 and sequences complementary 
thereto, or segments thereof. 

It is anticipated that the oligonucleotides may constitute 
all or part of an "array" or W DNA chip" (i.e., an arrangement 
of different oligonucleotides and/ or PNA-oligomers bound to a 
solid phase) . Such an array of different oligonucleotide- 
and/or PNA-oligomer sequences can be characterized, for 
example, in that it is arranged on the solid phase in the form 
of a rectangular or hexagonal lattice. The solid-phase surface 
may be composed of silicon, glass, polystyrene, aluminum, 
steel, iron, copper, nickel, silver, or gold. Nitrocellulose 
as well as plastics such as nylon, which can exist in the form 
of pellets or also as resin matrices, may also be used. An 
overview of the Prior Art in oligomer array manufacturing can 
be gathered from a special edition of Nature Genetics (Nature 
Genetics Supplement, Volume 21, January 1999, and from the 
literature cited therein) . Fluorescently labeled probes are 
often used for the scanning of immobilized DNA arrays. The 
simple attachment of Cy3 and Cy5 dyes to the 5 1 -OH of the 
specific probe are particularly suitable for fluorescence 
labels. The detection of the fluorescence of the hybridized 
probes may be carried out, for example, via a confocal 
microscope. Cy3 and Cy5 dyes, besides many others, are 
commercially available. 

It is particularly preferred that the oligomers according to 
the invention are utilised for at least one of: detection 
of; detection and differentiation between or among 
subclasses of; diagnosis of; prognosis of; treatment of; 
monitoring of; and treatment and monitoring of colon cell 
proliferative disorders. This is enabled by use of said sets 
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for the detection or detection and differentiation of one or 
more of the following classes of tissues: colorectal 
carcinoma, colon adenoma, inflammatory colon tissue, grade 
2 dysplasia colon adenomas less than 1 cm, grade 3 
dysplasia colon adenomas larger than 1 cm, normal colon 
tissue, non-colon healthy tissue and non-colon cancer 
tissue . 

The present invention further provides a method for 
ascertaining genetic and/or epigenetic parameters of the 
genomic sequences according to SEQ ID NO: 1 to SEQ ID NO: 15 
within a subject by analyzing cytosine methylation and single 
nucleotide polymorphisms . 

Wherein the object of the analysis is the detection or 
monitering of colon cell proliferative disorders it is 
preferred that the methylation of one or more genomic sequences 
selected fromTabl 1 and/or their regulatory sequences are 
analysed. In this embodiment of the invention it is 
particularly preferred that the methylation of one or more of 
the sequenecs of the group consisting SEQ ID NO: 1 to SEQ ID 
NO: 15 and/or their regulatory sequences are analysed. 

In a preferred embodiment the method comprises contacting 
a nucleic acid comprising one or more of SEQ ID NO: 1 to SEQ ID 
NO: 15 in a biological sample obtained from said subject with 
at least one reagent or a series of reagents, wherein said 
reagent or series of reagents, distinguishes between methylated 
and non-methylated CpG dinucleotides within the target nucleic 
acid. 

Preferably, said method comprises the following steps: In the 
first step, a sample of the tissue to be analysed is obtained. 
The source may be any suitable source, such as cell lines, 
histological slides, biopsies, tissue embedded in paraffin, 
bodily fluids, ejaculate, urine, blood and all possible 
combinations thereof. Genomic DNA is then isloated from said 
biological sample, this may be by any means standard in the 
art, including the use of commercially available kits. Briefly, 
wherein the DNA of interest is encapsulated in by a cellular 
membrane the biological sample must be disrupted and lysed by 
enzymatic, chemical or mechanical means. The DNA solution may 
then be cleared of proteins and other contaminants e.g. by 
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digestion with proteinase K. The genomic DNA is then recovered 
from the solution. This may be carried out by means of a 
variety of methods including salting out, organic extraction or 
binding of the DNA to a solid phase support. The choice of 
method will be affected by several factors including time, 
expense and required quantity of DNA. 

Once the nucleic acids have been extracted, the genomic double 
stranded DNA is used in the analysis - 

In the second step of the method, the genomic DNA sample 
is treated in such a manner that cytosine bases which are 
unmethylated at the 5 '-position are converted to uracil, 
thymine, or another base which is dissimilar to cytosine in 
terms of hybridization behavior. This will be understood as 
* pretrea tment ' herein . 

The above described treatment of genomic DNA is preferably 
carried out with bisulfite (hydrogen sulfite, disulfite) and 
subsequent alkaline hydrolysis which results in a conversion of 
non-methylated cytosine nucleobases to uracil or to another 
base which is dissimilar to cytosine in terms of base pairing 
behavior . 

In the third step of the method, fragments of the 
pretreated DNA are amplified, using sets of primer 
oligonucleotides according to the present invention, and an 
amplification enzyme. The amplification of several DNA 
segments can be carried out simultaneously in one and the same 
reaction vessel. Typically, the amplification is carried out 
using a polymerase chain reaction (PCR) . The set of primer 
oligonucleotides includes at least two oligonucleotides whose 
sequences are each reverse complementary, identical, or 
hybridize under stringent or highly stringent conditions to an 
at least 16-base-pair long segment of the base sequences of one 
or more of SEQ ID NO: 16 to SEQ ID NO: 75 and sequences 
complementary thereto . 

In an alternative embodiment of the method, the 
methylation status of preselected CpG positions within the 
nucleic acid sequences comprising one or more of SEQ ID NO: 1 
to SEQ ID NO: 15 may be detected by use of methylation-specif ic 
primer oligonucleotides. This technique (MSP) has been 
described in United States Patent No. 6,265,171 to Herman. The 
use of methylation status specific primers for the 
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amplif ication of bisulfite treated DNA allows the 
differentiation between methylated and unmethylated nucleic 
acids . MSP primers pairs contain at least one primer which 
hybridizes to a bisulfite treated CpG dinucleotide. Therefore, 
the sequence of said primers comprises at least one CpG 
dinucleotide. MSP primers specific for non-methylated DNA 
contain a W T' at the 3' position of the C position in the CpG. 
Preferably, therefore, the base sequence of said primers is 
required to comprise a sequence having a length of at least 9 ■ 
nucleotides which hybridizes to a pretreated nucleic acid 
sequence according to one of SEQ ID NO: 16 to SEQ ID NO: 75 and 
sequences complementary thereto, wherein the base sequence of 
said oligomers comprises at least one CpG dinucleotide. 

A further preferred embodiment of the method comprises the 
use of blocker oligonucleotides- The use of such blocker 
oligonucleotides has been described by Yu et al . , BioTachniques 
23:714-720, 1997. Blocking probe oligonucleotides are 
hybridized to the bisulfite treated nucleic acid concurrently 
with the PCR primers. PCR amplification of the nucleic acid is 
terminated at the 5' position of the blocking probe, such that 
amplification of a nucleic acid is suppressed where the 
complementary sequence to the blocking probe is present. The 
probes may be designed to hybridize to the bisulfite treated 
nucleic acid in a methylation status specific manner. For 
example, for detection of methylated nucleic acids within a 
population of unmethylated nucleic acids, suppression of the 
amplification of nucleic acids which are unmethylated at the 
position in question would be carried out by the use of 
blocking probes comprising a 'CpA' or % TpA' at the position in 
question, as opposed to a *CpG' if the suppression of 
amplif ication of methylated nucleic acids is desired. 

For PCR methods using blocker oligonucleotides, efficient 
disruption of polymerase-mediated amplification requires that 
blocker oligonucleotides not be elongated by the polymerase. 
Preferably, this is achieved through the use of blockers that 
are 3 ' -deoxyoligonucleotides , or oligonucleotides derivitized 
at the 3' position with other than a tt free" hydroxyl group. 
For example, 3'-0-acetyl oligonucleotides are representative of 
a preferred class of blocker molecule. 
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Additionally , polymerase-mediated decomposition of the 
blocker oligonucleotides should be precluded. Preferably, such 
preclusion comprises either use of a polymerase lacking 5 '-3' 
exonuclease activity, or use of modified blocker 
oligonucleotides having, for example, thioate bridges at the 
5'-terminii thereof that render the blocker molecule nuclease- 
resistant. Particular applications may not require such 5' 
modifications of the blocker. For example, if the blocker- and 
primer-binding sites overlap, thereby precluding binding of the 
primer (e.gr. , with excess blocker), degradation of the blocker 
oligonucleotide will be substantially precluded. This is 
because the polymerase will not extend the primer toward, and 
through (in the 5 '-3' direction) the blocker— a process that 
normally results in degradation of the hybridized blocker 
oligonucleotide . 

A particularly preferred blocker/PCR embodiment, for 
purposes of the present invention and as implemented herein, 
comprises the use of peptide nucleic acid (PNA) oligomers as 
blocking oligonucleotides. Such PNA blocker oligomers are 
ideally suited, because they are neither decomposed nor 
extended by the polymerase. 

Preferably, therefore, the base sequence of said blocking 
oligonucleotides is required to comprise a sequence having a 
length of at least 9 nucleotides which hybridizes to a 
pretreated nucleic acid sequence according to one of SEQ ID NO: 
16 to SEQ ID NO: 75 and sequences complementary thereto, 
wherein the base sequence of said oligonucleotides comprises at 
least one CpG, TpG or CpA dinucleotide . 

The fragments obtained by means of the amplification can 
carry a directly or indirectly detectable label. Preferred are 
labels in the form of fluorescence labels, radionuclides, or 
detachable molecule fragments having a typical mass which can 
be detected in a mass spectrometer. Where said labels are mass 
labels, it is preferred that the labeled amplificates have a 
single positive or negative net charge, allowing for better 
detectability in the mass spectrometer. The detection may be 
carried out and visualized by means of, e.g., matrix assisted 
laser desorpt ion/ ionization mass spectrometry (MAL.DI) or using 
electron spray mass spectrometry (ESI ) . 
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Matrix Assisted Laser Desorption/Ionization Mass 
Spectrometry ( MALDI - TOF ) is a very efficient development for 
the analysis of biomolecules (Karas & Hillenkamp, Anal Chem. , 
60:2299-301, 1988) . An analyte is embedded in a light- 
absorbing matrix. The matrix is evaporated by a short laser 
pulse thus transporting the analyte molecule into the vapour 
phase in an unfragmented manner* The analyte is ionized by 
collisions with matrix molecules. An applied voltage 
accelerates the ions into a field-free flight tube. Due to 
their different masses, the ions are accelerated at different 
rates. Smaller ions reach the detector sooner than bigger 
ones. MALDI-TOF spectrometry is well suited to the analysis of 
peptides and proteins . The analysis of nucleic acids is 
somewhat more difficult (Gut & Beck, Current Innovations and 
Future Trends, 1:147-57, 1995). The sensitivity with respect 
to nucleic acid analysis is approximately 100-times less than 
for peptides, and decreases disproportionally with increasing 
fragment size. Moreover, for nucleic acids having a multiply 
negatively charged backbone, the ionization process via the 
matrix is considerably less efficient. In MALDI-TOF 
spectrometry, the selection of the matrix plays an eminently 
important role. For desorption of peptides, several very 
efficient matrixes have been found which produce a very fine 
crystallisation. There are now several responsive matrixes for 
DNA, however, the difference in sensitivity between peptides 
and nucleic acids has not been reduced. This difference in 
sensitivity can be reduced, however, by chemically modifying 
the DNA in such a manner that it becomes more similar to a 
peptide. For example, phosphorothioate nucleic acids, in which 
the usual phosphates of the backbone are substituted with 
thiophosphates , can be converted into a charge-neutral DNA 
using simple alkylation chemistry (Gut & Beck, Nucleic Acids 
Res. 23: 1367-73, 1995). The coupling of a charge tag to this 
modified DNA results in an increase in MALDI-TOF sensitivity to 
the same level as that found for peptides . A further advantage 
of charge tagging is the increased stability of the analysis 
against impurities, which makes the detection of unmodified 
substrates considerably more difficult. 

In the fourth step of the method, the amplificates 
obtained during the third step of the method are analysed in 
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order to ascertain the methylation status of the CpG 
dinucleotides prior to the treatment. 

In embodiments where the amplif icates were obtained by 
means of MSP amplification, the presence or absence of an 
amplif icate is in itself indicative of the methylation state of 
the CpG positions covered by the primer, according to the base 
sequences of said primer. 

Amplificates obtained by means of both standard and 
methylation specific PCR may be further analyzed by means of 
hybridization-based methods such as, but not limited to, array 
technology and probe based technologies as well as by means of 
techniques such as sequencing and template directed extension. 

In one embodiment of the method, the amplificates 
synthesised in step three are subsequently hybridized to an 
array or a set of oligonucleotides and/or PNA probes. In this 
context, the hybridization takes place in the following manner: 
the set of probes used during the hybridization is preferably 
composed of at least 2 oligonucleotides or PNA- oligomers; in 
the process, the amplif icates serve as probes which hybridize 
to oligonucleotides previously bonded to a solid phase; the 
non-hybridized fragments are subsequently removed; said 
oligonucleotides contain at least one base sequence having a 
length of at least 9 nucleotides which is reverse complementary 
or identical to a segment of the base sequences specified in 
the present Sequence Listing; and the segment comprises at 
least one CpG , TpG or CpA dinucleotide . 

In a preferred embodiment, said dinucleotide is present in 
the central third of the oligomer. For example, wherein the 
oligomer comprises one CpG dinucleotide, said dinucleotide is 
preferably the fifth to ninth nucleotide from the 5 '-end of a 
13-mer. One oligonucleotide exists for the analysis of each 
CpG dinucleotide within the sequence according to SEQ ID NO: 1 
to SEQ ID NO: 15, and the equivalent positions within SEQ ID 
NO: 16 to SEQ ID NO: 75. Said oligonucleotides may also be 
present in the form of peptide nucleic acids. The non- 
hybridized amplificates are then removed. The hybridized 
amplificates are then detected. In this context, it is 
preferred that labels attached to the amplificates are 
identifiable at each position of the solid phase at which an 
oligonucleotide sequence is located. 
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In yet a further embodiment of the method, the genomic 
methylation status of the CpG positions may be ascertained by 
means of oligonucleotide probes that are hybridised to the 
bisulfite treated DNA concurrently with the PCR amplification 
primers (wherein said primers may either be methylation 
specific or standard) . 

A particularly preferred embodiment of this method is the 
use of fluorescence-based Real Time Quantitative PCR (Heid et 
al., Genome Res. 6:986-994, 1996; also see United States Patent 
No. 6,331,393) employing a dual-labeled fluorescent 
oligonucleotide probe (TaqMan™ PCR, using an ABI Prism 7700 
Sequence Detection System, Perkin Elmer Applied Biosys terns, 
Foster City, California) . The TaqMan™ PCR reaction employs the 
use of a nonextendible interrogating oligonucleotide, called a 
TaqMan™ probe, which, in preferred imbodiments, is designed to 
hybridize to a GpC-rich sequence located between the forward 
and reverse amplification primers. The TaqMan™ probe further 
comprises a fluorescent "reporter moiety" and a "quencher 
moiety" covalently bound to linker moieties (e.g., 
phosphoramidites) attached to the nucleotides of the TaqMan™ 
oligonucleotide. For analysis of methylation within nucleic 
acids subsequent to bisulfite treatment, it is required that 
the probe be methylation specific, as described in United 
States Patent No. 6,331,393, (hereby incorporated by reference 
in its entirety) also known as the MethylLight™ assay. 
Variations on the TaqMan™ detection methodology that are also 
suitable for use with the described invention include the use 
of dual -probe technology (Light cycler™) or fluorescent 
amplification primers (Sunrise™ technology) . Both these 
techniques may be adapted in a manner suitable for use with 
bisulfite treated DNA, and moreover for methylation analysis 
within CpG dinucleotides . 

A further suitable method for the use of probe 
oligonucleotides for the assessment of methylation by analysis 
of bisulfite treated nucleic acids In a further preferred 
embodiment of the method, the fifth step of the method 
comprises the use of template-directed oligonucleotide 
extension, such as MS-SNuPE as described by Gonzalgo & Jones, 
Nucleic Acids Res. 25:2529-2531, 1997. 
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In yet a further embodiment of the method, the fifth step 
of the method comprises sequencing and subsequent sequence 
analysis of the amplificate generated in the third step of the 
method (Sanger F., et al., Proc Natl Acad Sci USA 74:5463-5467, 
1977) . 

Best mode 

In the most preferred embodiment of the method the 
nucleic acids according to SEQ ID NO: 1 to SEQ ID NO: 15 are 
isolated and treated according to the first three steps of the 
method outlined above, namely: 

a. obtaining, from a subject, a biological sample having 
subject genomic DNA; 

b. extracting or otherwise isolating the genomic DNA; 

c. treating the genomic DNA of b) , or a fragment 
thereof, with one or more reagents to convert 
cytosine bases that are unmethylated in the 5- 
position thereof to uracil or to another base that is 
detectably dissimilar to cytosine in terms of 
hybridization properties; 

and wherein the subsequent amplification of d) is carried out 
in a methylation specific manner, namely by use of methylation 
specific primers or blocking oligonucleotides , and further 
wherein the detection of the amplificates is carried out by 
means of a real-time detection probes, as described above. 

Wherein the subsequent amplification of d) is carried out 
by means of methylation specific primers, as described above, 
said methylation specific primers comprise a sequence having a 
length of at least 9 nucleotides which hybridizes to a 
pretreated nucleic acid sequence according to one of SEQ ID NO: 
16 to SEQ ID NO: 75 and sequences complementary thereto, 
wherein the base sequence of said oligomers comprises at least 
one CpG dinucleotide . 

Step e) of the method, namely the detection of the 
specific amplificates indicative of the methylation status of 
one or more CpG positions according to SEQ ID NO: 1 to SEQ ID 
NO: 15 is carried out by means of real-time detection methods 
as described above. 

In an alternative most preferred embodiment of the method the 
subsequent amplification of d) is carried out in the presence 
of blocking oligonucleotides , as described above. Said blocking 
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ollgronucleo tides comprising a sequence having a length of at 
least 9 nucleotides which hybridizes to a pretreated nucleic 
acid sequence according to one of SEQ ID NO: 16 to SEQ ID NO: 
75 and sequences complementary thereto, wherein the base 
sequence of said oligomers comprises at least one CpG, TpG or 
CpA dinucleotide. 

Step e) of the method, namely the detection of the 
specific amplif icates indicative of the methylation status of 
one or more CpG positions according to SEQ ID NO: 1 to SEQ ID 
NO: 15 is carried out by means of real-time detection methods 
as described above. 

Additional embodiments of the invention provide a method 
for the analysis of the methylation status of genomic DNA 
according to the invention (SEQ ID NO: 1 to SEQ ID NO: 15, and 
complements thererof ) without the need for pretreatment . 

In the first step of such additional embodiments, the genomic 
DNA sample is isolated from tissue or cellular sources. 
Preferably, such sources include cell lines, histological 
slides, body fluids, or tissue embedded in paraffin. In the 
second, step, the genomic DNA is extracted. Genomic DNA is then 
isloated from said biological sample, this may be by any means 
standard in the art, including the use of commercially 
available kits. Briefly, wherein the DNA of interest is 
encapsulated in by a cellular membrane the biological sample 
must be disrupted and lysed by enzymatic, chemical or 
mechanical means. The DNA solution may then be cleared of 
proteins and other contaminants e.g. by digestion with 
proteinase K. The genomic DNA is then recovered from the 
solution. This may be carried out by means of a variety of 
methods including salting out, organic extraction or binding of 
the DNA to a solid phase support. The choice of method will be 
affected by several factors including time, expense and 
required quantity of DNA. 

In a preferred embodiment, the DNA may be cleaved prior to 
the treatment, and this may be by any means standard in the 
state of the art, in particular with methylation-sensitive 
restriction endonuc leases . 
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In the third step, the DNA is then digested with one or 
more methylation sensitive restriction enzymes. The digestion 
is carried out such that hydrolysis of the DNA at the 
restriction site is informative of the methylation status of a 
specific CpG dinucleotide . 

In the fourth step, which is optional but a preferred 
embodiment, the restriction fragments are amplified. This is 
preferably carried out using a polymerase chain reaction, and 
said amplificates may carry suitable detectable labels as 
discussed above, namely fluorophore labels, radionuclides and 
mass labels . 

In the fifth step the amplificates are detected. The 
detection may be by any means standard in the art, for example, 
but not limited to, gel electrophoresis analysis, hybridization 
analysis, incorporation of detectable tags within the PCR 
products, DNA array analysis, MALDI or ESI analysis. 

In the final step the of the method the presence, absence 
or subclass of colon cell proliferative disorder is deduced 
based upon the methylation state of at least one CpG 
dinucleotide sequence of SEQ ID NO: 1 to SEQ ID NO: 15 , or an 
average, or a value reflecting an average methylation state of 
a plurality of CpG dinucleotide sequences of SEQ ID NO: 1 to 
SEQ ID NO: 15. 

Monitoring, Diagnostic and/or Prognostic Assays for colon cell 
proliferative disorders 

The present invention enables montoring, diagnosis and/or 
prognosis of events which are disadvantageous to patients or 
individuals in which important genetic and/or epigenetic 
parameters within one or more of SEQ ID NO: 1 to SEQ ID NO: 15 
may be used as markers . Said parameters obtained by means of 
the present invention may be compared to another set of genetic 
and/or epigenetic parameters, the differences serving as the 
basis for a diagnosis and/or prognosis of events which are 
disadvantageous to patients or individuals . 

Specifically, the present invention provides for 
montoring, diagnostic and/or prognostic cancer assays based on 
measurement of differential methylation of one or more CpG 
dinucleotide sequences of SEQ ID NO: 1 to SEQ ID NO: 15, or of 
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subregions thereof that comprise such a CpG dinucleotide 
sequence. Typically, such assays involve obtaining a tissue 
sample from a test tissue, performing an assay to measure the 
methylation status of at least one of one or more CpG 
dinucleotide sequences of SEQ ID NO: 1 to SEQ ID NO: 15 derived 
from the tissue sample, relative to a control sample, or a 
known standard and making a diagnosis or prognosis based 
thereon . 

In particular preferred embodiments, inventive oligomers 
are used to assess the CpG dinucleotide methylation status, 
such as those based on SEQ ID NO: 1 to SEQ ID NO: 75, or arrays 
thereof, as well as in kits based thereon and useful for the 
diagnosis and/or prognosis of colon cell proliferative 
disorders . 

Kits 

Moreover, an additional aspect of the present invention is 
a kit comprising, for example: a bisulf ite-containing reagent; 
a set of primer oligonucleotides containing at least two 
oligonucleotides whose sequences in each case correspond, are 
complementary, or hybridize under stringent or highly stringent 
conditions to a 16 -base long segment of the sequences SEQ ID 
NO: 1 to SEQ ID NO: 75; oligonucleotides and/or PNA-oligomers ; 
as well as instructions for carrying out and evaluating the 
described method. In a further preferred embodiment, said kit 
may further comprise standard reagents for performing a CpG 
position-specific methylation analysis, wherein said analysis 
comprises one or more of the following techniques: MS-SNuPE, 
MSP, MethyLight ™, HeavyMethyl™ , COBRA, and nucleic acid 
sequencing. However, a kit along the lines of the present 
invention can also contain only part of the aforementioned 
components . 

.Typical reagents (e.g\, as might be found in a typical 
COBRA- based kit) for COBRA analysis may include, but are not 
limited to: PCR primers for specific gene (or methylation- 
altered DNA sequence or CpG island) ; restriction enzyme and 
appropriate buffer; gene-hybridization oligo; control 
hybridization oligo; kinase labeling kit for oligo probe; and 
radioactive nucleotides. Additionally, bisulfite conversion 
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reagents may include: DNA denaturation buffer; sulfonation 
buffer; DNA recovery reagents or kits (e.g. , precipitation, 
ultrafiltration, affinity column) ; desulf onation buffer; and 
DNA recovery components . 

Typical reagents (e.g. , as might be found in a typical 
MethyLight* -based kit) for MethyLdght • analysis may include, 
but are not limited to: PCR primers for specific gene (or 
methylat ion-altered DNA sequence or CpG island) ; Light Cycler® 
and or TaqMan® probes; optimized PCR buffers and 
deoxynucleotides ; and Taq polymerase. 

Typical reagents (e.g., as might be found in a typical Ms- 
SNuPE-based kit) for Ms-SNuPE analysis may include, but are not 
limited to: PCR primers for specific gene (or methylat ion- 
altered DNA sequence or CpG island) ; optimized PCR buffers and 
deoxynucleotides; gel extraction kit; positive control primers; 
Ms-SNuPE primers for specific gene; reaction buffer (for the 
Ms-SNuPE reaction) ; and radioactive nucleotides. Additionally, 
bisulfite conversion reagents may include: DNA denaturation 
buffer; sulfonation buffer; DNA recovery regents or kit (e.gr., 
precipitation, ultrafiltration, affinity column) ; desulf onation 
buffer; and DNA recovery components. 

Typical reagents (e.g., as might be found in a typical 
MSP -based kit) for MSP analysis may include, but are not 
limited to: methylated and unmethylated PCR primers for 
specific gene (or methylation-altered DNA sequence or CpG 
island), optimized PCR buffers and deoxynucleotides, and 
specific probes . 

While the present invention has been described with specificity 
in accordance with certain of its preferred embodiments, the 
following example serves only to illustrate the invention and 
is not intended to limit the invention within the principles 
and scope of the broadest interpretations and equivalent 
configurations thereof . 



Table 1 



Gene /Genomi c 1 oca t i on 


Genomic SEQ 
ID 


Treated 
methylated 
SEQ IDs 


Treated 
unme thy late 
d SEQ IDS 


— ... r- 


1 


16 & 17 


46 Sc 47 
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d It J HOITlOlOy X 








RING FINGER PROTEIN 4 


2 


18 & 19 


48 & 49 


TUMOR NECROSIS FACTOR 
RECEPTOR SUPERFAMILY 
MEMBER 1 o PRECURSOR 


3 


2 0 & 21 




Region of chromosome 
14Q31.1 


4 


22 & 23 


52 & 53 


Chondroitin sulfate 
proteoglycan 2 
(versican) (CPSG2) 


5 


24 & 25 


54 & 55 


EYA 4 


6 


26 & 27 


56 & 57 


MOTHERS AGAINST 
DECAPENTAPLEGIC 
HOMOLOG 7 (SMAD 7) 


7 


28 & 29 


58 & 59 


ENDOPLASMIN PRECURSOR 
(94 KDA GLUCOSE- 
REGULATED PROTEIN) 
(GRP94) (GP96 HOMOLOG) 
(TUMOR REJECTION 
ANTTfJKN 1 \ 


o 


30 & 31 


60 & 61 


Region of chromosome 


Q 


-J A* oc —> ~> 


62 & 63 


B-CELL LYMPHOMA 6 
PROTEIN (BCL-6) (ZINC 
FINGER PROTEIN 51) 

\ IJritj O XT iLl XM / V XDV^ J_i 

5) 


10 


34 & 35 


64 & 65 


XlUrlCjUDUA JTXWJ 1 JL11M 

ARISTALESS-LIKE 4 


11 


36 & 37 ! 


66 & 67 


WITH EGF-LIKE AND TWO 
FOLLI STATIN-LIKE 
DOMAINS 2 


12 


38 & 39 


68 & 69 


Hypothetical protein- 
leucine rich repeat 


13 


40 & 41 


70 & 71 


Candidate tumor 
suppressor 8p22 


14 


42 & 43 


72 & 73 


VESICULAR INHIBITORY 
AMINO ACID TRANSPORTER 
(GABA AND GLYCINE 
TRANSPORTER) 
(VESICULAR GABA 
TRANSPORTER) (HVIAAT) 


15 


44 & 45 


74 & 75 



EXAMPLES 



MSP analysis of the genes according to Table 1. 
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In the following analysis the methylation status of the 
genes according to Table 1 were analysed by means of 
methylation specific amplification using the primers according 
to Table 2 (below) . 

The study was run on approximately 140 samples from colon, 
breast and liver carcinoma, normal tissue (peripheral blood 
lymphocytes, healthy colon and healthy tissue adjacent to 
colon carcinoma), colon polyps and other colon diseases. 
Genomic DNA was analyzed using the MSP technique after 
bisulfite conversion. Total genomic DNA of all samples was 
bisulfite treated converting unmethylated cytosines to 
uracil. Methylated cytosines remained conserved. Bisulfite 
treatment was performed with minor modifications according 
to the protocol described in Olek et al . (1996). 

The sequence of interest was then amplified by means of 
methylation specific primers, the amplificate is then 
detected by means of methylation specific Taqman probes. 



Table 2: Oligonucleotides for MSP Taaman. 



Genomic SEQ ID 
NO: 


Forward primer sequence 


Probe 


Reverse primer sequence 


1 


acggcgttttggttcgt SEQ XD NO: 76 


cgacttaataacgacgtacgcgaaccc SEQ ID NO: 
77 


gtcatctaacaacgacctaactaacg . 
ID NO: 78 


2 


ctctaaaccgcgaaaactccg SEQ ID NO: 
79 


cgcgtttgttcgggtttcggg SEQ ID NO: 80 


gggatgagaggttgttgacgttc SEQ 
NO: 81 


3 


agggcgttttgttggcg SEQ ID NO: 82 


atcggcgttttagcgtgcggg SEQ ID NO: 83 


ctacaacctaaacgacgcgct SEQ : 
NO: 84 


4 


gaggtttcggttcgaggttc SEQ ID NO: 85 


acgtcacaaaaaaaaaacccacgtaataataacgaa 
SEQ ID NO: 86 


ctttactacaacgaaaacgccg SEQ 
NO: 87 


5 


gattaaaaaatttgtttttattcggtcg SEQ 
ID NO: 88 


cgcggaagtagagtaggcgggcg SEQ ID NO: 89 


tccgcttcgaaaacctcga SEQ ID 1 
90 


6 


caaaatccgtcgacgataaaac SEQ ID NO: 
91 


tcgggttggggttcgtattcgg SEQ ID NO: 92 


acg tagt t atagg 1 1 t tga tagt 1 1 1 ac 
SEQ ID NO: 93 


7 


gaacgcgaaaaaattaacatctcg SEQ ID 
NO: 94 


tttcgtttttatcgtggtgggtatgttcgtgttta 
SEQ ID NO: 95 


tttcgggattgttgttggtgttt SEQ 
NO: 96 


8 


catcaatatcctaatcataacctactatctatcg 
SEQ ID NO: 97 


taggtttagtcgcgcgttcgcga SEQ ID NO: 98 


gagatggggaattttttgacgtc SEQ 
NO: 99 


9 


c 1 1 aaaa taa taacgcgaaa t a t aacgc t SEQ 
ID NO: 100 


cgaaaaacgatcgaaatcccgactctc SEQ ID NO: 
101 


tttgcggaaagacgtcgtt SEQ ID 1 
102 


10 


ccaaaaatacgcgaccacg SEQ ID NO: 103 


ctaaaccactaatattcgacgtaattctccgtatttac 
SEQ ID NO: 104 


cgttttttcgtattcggaaagac SEQ 
NO: 105 


11 


tacgtgtgaggggttcgc SEQ ID NO: 106 


ccaatcgaaactatccaaatacgataaccga SEQ ID 
NO: 107 


ccaataaaaattcgtaaaaaaaaacg . 
ID NO: 108 
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12 


cggttagtcggaggcgc SEQ ID NO: 109 


cgaacgaaacaacctaaacgaaaaccccg SEQ ID 
NO: 110 


ccaaaataaccaatcgacccg SEQ '. 
NO: 111 


13 


gggattaagattttcggttagtttcg SEQ ID 
NO: 112 


caaacgaatcacataaaaaacgtaataacccgaa SEQ 
ID NO: 113 


aacgctacgcgactaaattcga SEQ 
NO: 114 


14 


gggttagggtgggtcgc SEQ ID NO: 115 


cgtcggcgtgggcgatgtc SEQ ID NO: 116 


gactcccgccccaacg SEQ ID NO: 


15 


cggagggtacggagattacg SEQ ID NO: 
118 


cgaaaccctaaatatcccgaataacgccg SEQ ID 
NO: 119 


cgacgacgcgcgaaa SEQ ID NO: 



Reaction Conditions for Taqraan PCR program: 



Denaturation at 95°C for 10 minutes. 



50 cycles 



Denaturation at 95°C for 15 seconds. 



Annealing at 62 °C for 1 minute. Except SEQ ID 
NO : 1 5 , which has an annealing temp of 60 °C. 



Table 3: Results 



Genomic SEQ ID 
NO 


AUC 


Sensitivity 


Specificity 


1 


0.57 


0.15 


0.99 


2 


0.54 


0.91 


0.09 


3 


0.82 


0.74 


0.90 


4 


0.83 


0.69 


0.94 


5 


0.92 


0.83 


0.96 


6 


0.83 


0.71 


0.94 


7 


0.54 


0.25 


0.91 


8 


0.6 


0 .11 


0.99 


9 


0.59 


0.1 


0.99 


10 


0.62 


0.11 


0. 98 


11 


0.78 


0.74 


0.95 


12 


0.88 


0 .84 


0.94 


13 


0.73 


0.79 


0.38 


14 


0.79 


0.64 


0.76 


15 


0.77 


0.78 


0.44 











The following table gives the sens 
of MSP assays analysing two genes 



itivities and specificities 



Table 4 : Sensitivity and specificity of two member gene panel 

assays . 



E 



Genomic SEQ ID 

Os.: 



Sensitivity (%) [Specificity (%) 
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9& 12 


88 


97 


11 &12 


88 


97 


12& 15 


90 


95 


6&7 


87 


97 


5 & 11 


89 


94 


6& 12 


84 


94 


12 & 3 


80 


98 



HeawMethvl analysis of the genes according to SEP ID Nos: 3, 4 
&12 , 

In the following, methylation of the genomic sequences 
according to SEQ ID Nos: 3, 4 & 12 was analysed in 15 
colorectal cancer samples (referred to as 'colon' in Table 5) , 
and eight peripheral blood lymphocyte samples referred to as 
* PBL ' in Table 5) . 

Total genomic DNA of all samples was bisulfite treated 
converting unmethylated cytosines to uracil. Methylated 
cytosines remained conserved. Bisulfite treatment was performed 
with minor modifications according to the protocol described in 
Olek et al. (1996) . 

The sequence of interest was then amplified by means of primers 
specific for bisulfite treated DNA and a blocking 
oligonucleotide (in order to suppress amplification of non- 
methylated DNA) . The 3 'OH terminus of the blocking 
oligonucleotide was "capped" by a phosphate group to prevent 
polymerase from extending the primer during PCR reaction. The 
amplificate is then detected by means of methylation specific 
LightCycler probes, wherein one probe was labelled with 
5'red64 0 and had a capped 3 'OH terminus and the other was 
labelled with 3 'Fluorescein. 

Genomic SEQ ID NO: 12 

Primer SEQ ID NOS: 121 & 122 

Blocking oligonucleotide SEQ ID NOS: 123 

Probe SEQ ID NOS: 124 (Fluorescein) & 125 (red640) 

Genomic SEQ ID NO: 3 

Primer SEQ ID NOS: 12 6 & 127 

Blocking oligonucleotide SEQ ID NOS: 128 

Probe SEQ ID NOS: 129 (Fluorescein) & 130 (red640) 
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Genomic SEQ ID NO: 4 

Primer SEQ ID NOS : 131 & 132 

Blocking oligonucleotide SEQ ID NOS: 133 

Probe SEQ ID NOS: 134 (Fluorescein) & 135 (red640) 

The reactions were performed in a total volume of 20 ixl using a 
LightCycler device (Roche Diagnostics) . The real time PCR 
reaction mix contained 10 /xl of template DNA , 2 fil of 
FastStart LightCycler reaction mix for hybridization probes 
(Roche Diagnostics, Penzberg) , 



Table 5: Results of HeawMethyl assays 



issue 


Crossing points 


Crossing points 




Calculated 
Concentration 
DNA undiluted 
(pg/lOul) 




Genomic SEQ ID NO: 3 


Genomic SEQ ID NO: 12 


Genomic SEQ ID NO: 4 (undiluted) 


Genomic SEQ ID NO : 4 (diluted 


olon 


36,73 


38,22 


39, 58 




5.85 


olon 


33, 1 


>51 






386, 


olon 


39,8 


42,84 






57 , 


olon 


39,74 


38,75 


32, 54 


33,89 


4B2 t 


olon 




>51 






132,5 


olon 




38,82 






29,95 


olon 


32, 08 


33,9 


33,43 


35,43 


81 


olon 


37,03 


36,91 






284,3 


olon 


38,07 


46,54 








olon 


35,22 


37,81 


36,8 


37, 03 


298,3 


olon 


33,66 


36 f 19 


33 t 23 


34,01 


135 


olon 


36, 7B 


37,46 


40,71 




87 


olon 




37,23 








olon 










166 


olon 


34,88 


34,71 


34,21 


34,49 


880, 


BL 










716. 


BL 










79 


BL 










584, 


BL 


38,55 








181 


BL 










547 


BL 




49,04 






76 


BL 










62 


BL 




39,75 






983 


lOOpg 


36,34 


37,29 


34,95 


34,51 


583, 


lOOpg 


36,06 


37,86 


34,64 


34,09 


428. 


MRoche lOOng 


34,48 








MPhi lOOng 










ater j 
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Bold type figures indicate those classified as positive because 
the amplification curves were flat* 

Serum analysis of the genes according to SEP ID Nos : 3, 6, 
14 &12. 

The assays as described above were then applied to a set of 
serum samples consisting 32 colorectal carcinoma serum samples, 
and compared to a set consisting of 54 normal serum samples, 
11 serum samples from benign colon lesions and 6 serum samples 
from inflammatory colon disease. 13 0 ul of serum was used per 
assay and the results were quantitatively reported as positive 
or negative. 

Table 6 



Marker or 
combination of 
markers 


Sensitivity 


Specificity 


1 2 ( HeavyMe t hy 1 ) 


25 


92.9 


12 (MSP) 


40.6 


94.3 


6 (MSP) 


21.9 


97 .1 


14 (MSP) 


37 .5 


77 . 6 


3 (MSP) 


15.6 


98 .3 


12, 3 & 6 


50 


95 .7 


12, 6 & 14 


59.4 


78 .9 
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0 6 -05- 200t 



We claim: 

1. A method for detecting, or for detecting and 
distinguishing between or among colon cell proliferative 
disorders in a subject, comprising contacting genomic DNA 
obtained from the subject with at least one reagent, or 
series of reagents that distinguishes between methylated 
and non-methylated CpG dinucleotides within at least one 
target region of the genomic DNA, wherein the target 
region comprises, or hybridizes under stringent conditions 
to at least 16 contiguous nucleotides of a sequence 
selected from the group consisting of SEQ ID NO: 1 to SEQ 
ID NO 15 respectively, wherein said contiguous nucleotides 
comprise at least one CpG dinucleotide sequence, and 
whereby detecting, or detecting and distinguishing between 
or among colon cell proliferative disorders is, at least 
in part, afforded. 

2 . A method according to claim 1 for detecting, or for 
detecting and distinguishing between or among colon cell 
proliferative disorders in a subject, comprising: 
-obtaining, from a subject, a biological sample having 
sub j ec t genomic DNA ; 
-contacting the genomic DNA, or a fragment thereof, with 
one reagent or a plurality of reagents for distinguishing 
between methylated and non methylated CpG dinucleotide 
sequences within at least one target sequence of the 
genomic DNA, or fragment thereof, wherein the target 
sequence comprises, or hybridizes under stringent 
conditions to, at least 16 contiguous nucleotides of a 
sequence taken from the group consisting of SEQ ID NO: 1 
to SEQ ID NO 15, said contiguous nucleotides comprising at 
least one CpG dinucleotide sequence; and 
-determining, based at least in part on said 
distinguishing, the methylation state of at least one 
target CpG dinucleotide sequence, or an average, or a 
value reflecting an average methylation state of a 
plurality of target CpG dinucleotide sequences, whereby 
detecting, or detecting and distinguishing between or 
among colon cell proliferative disorders is, at least in 
part, afforded. 
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3 . The method of claims 1 and 2, wherein detecting, or 
detecting and distinguishing between or among colon cell 
proliferative disorders comprises detecting, or detecting 
and distinguishing between or among one or more tissues 
selected from the group consisting of colorectal 
carcinoma, colon adenoma, inflammatory colon tissue, 
grade 2 dysplasia colon adenomas less than 1 cm, grade 3 
dysplasia colon adenomas larger than 1 cm, normal colon 
tissue, non-colon normal tissue, body fluids and non-colon 
cancer tissue. 

4. The method of claim 2, wherein distinguishing between 
methylated and non methylated CpG dinucleotide sequences 
within the target sequence comprises converting 
unmethylated cytosine bases within the target sequence to 
uracil or to another base that is detectably dissimilar to 
cytosine in terms of hybridization properties. 

5 . The method of claim 2, wherein distinguishing between 
methylated and non methylated CpG dinucleotide sequences 
within the target sequence (s) comprises methylation state- 
dependent conversion or non- conversion of at least one CpG 
dinucleotide sequence to the corresponding converted or 
non-converted dinucleotide sequence within a sequence 
selected from the group consisting of SEQ ID NO: 16 to SEQ 
ID NO 75, and contiguous regions thereof corresponding to 
the target sequence. 

6. The method of claim 2, wherein distinguishing between 
methylated and non methylated CpG dinucleotide sequences 
within the target sequence comprises use of at least one 
nucleic acid molecule or peptide nucleic acid (PNA) 
molecule comprising, in each case a contiguous sequence at 
least 9 nucleotides in length that is complementary to, or 
hybridizes under moderately stringent or stringent 
conditions to a sequence selected from the group 
consisting of SEQ ID NO: 16 to SEQ ID NO 75, and 
complements thereof . 
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7 . The method of claim 2, wherein the contiguous sequence 
comprises at least one CpG, TpG or CpA dinucleotide 
sequence . 

8 . A method according to claims 1 to 3 for detecting, or 
detecting and distinguishing between or among colon cell 
proliferative disorders in a subject, comprising: 

a. obtaining, from a subject, a biological sample having 
subject genomic DNA; 

b. extracting or otherwise isolating the genomic DNA; 

c. treating the genomic DNA of b) , or a fragment 
thereof, with one or more reagents to convert 
cytosine bases that are unmethylated in the 5- 
position thereof to uracil or to another base that is 
detectably dissimilar to cytosine in terms of 
hybridization properties; 

d. contacting the treated genomic DNA, or the treated 
fragment thereof, with an amplification enzyme and at 
least two primers comprising, in each case a 
contiguous sequence of at least 9 nucleotides that is 
complementary to, or hybridizes under moderately 
stringent or stringent conditions to a sequence 
selected from the group consisting of SEQ ID NO: 16 
to SEQ ID NO 75, and complements thereof, wherein the 
treated genomic DNA or the fragment thereof is either 
amplified to produce at least one amplificate, or is 
not amplified; and 

e) determining, based on a presence or absence of, or 
on a property of said amplificate, the methylation 
state of at least one CpG dinucleotide of a sequence 
selected from the group consisting SEQ ID NO: 1 to 
SEQ ID NO 15, or an average, or a value reflecting an 
average methylation state of a plurality of CpG 
dinucleotides of a sequence selected from the groups 
consisting of SEQ ID NO: 1 to SEQ ID NO 15, whereby 
at least one of detecting, or detecting and 
distinguishing between colon cell proliferative 
disorders is, at least in part, afforded. 
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9. The method of claim 8, further comprising in step d) the 
use of at least one nucleic acid molecule or peptide 
nucleic acid molecule comprising in each case a contiguous 
sequence at least 9 nucleotides in length that is 
complementary to, or hybridizes under moderately stringent 
or stringent conditions to a sequence selected from the 
group consisting of SEQ ID NO: 16 to SEQ ID NO 75, and 
complements thereof, wherein said nucleic acid molecule or 
peptide nucleic acid molecule suppresses amplification of 
the nucleic acid to which it is hybridized. 



10. The method of claim 8, wherein contacting or 

amplifying in d) , comprises use of methylation-specif ic 
primers . 
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Abstract 

The invention provides methods, nucleic acids and kits for 
detecting, or for detecting and distinguishing between or among 
colon cell proliferative disorders. The invention discloses 
genomic sequences the methylation patterns of which have 
utility for the improved detection of and differentiation 
between said class of disorders, thereby enabling the improved 
diagnosis and treatment of patients . 
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<110> Epigenomics AG 

<120> Methods and nucleic acids for the improved detection of colon cell 
proliferative disorders. 

<160> 135 

<210> 1 
<211> 3049 
<212> DNA 

<213> Homo Sapiens 
<400> 1 

tactcggtgt agctggtcca gatcttgtcc tcgctctggt cggtgctgct ggcaaaggcg 60 

caggtgcccg tggagctgca cgccaccatg tggaagcccg actcggacag cttgtcgaag 120 

gcctgctcca ggaagttgaa cttgaggtaa tagcgcgagg tgtagcgctc cgggggacgg 180 

tcggggtccc ggctttcgtt cagggtgtcc ccaaacacct ccttggccag cgacgtcttt 240 

ccgcaaacgg tgatgcgcgc cactcgccgg aacttggcgt ccgcctgcgc gtcccgcccg 300 

atggtgtagg agccgcggta gccgatggtg atgtagcccg agcgccggct gccgtccagc 360 

gactgggacg gcgtgagcag cgggcccgcc gcgcccccgg acggactgcg gctagccagc 42 0 

tccagcgtgg gcgacggcgc cccggcagag gcgccctcct gctgttcggg ctccgagtag 480 

ccaagcggca gcagctcgtc acccagcgag ccctccttgt gcaccccgcg ccgcgagggc 540 

ggcggccccg ggccgggctg ctggggcgcc ccgaggcggc gcacgagctc tggcagctcg 600 

aagtactcgg cctcgcgctg cagccggctg cgctcgggga agtagtcggg cagcacgagc 660 

tgcaagtccc gcaggtaatc caggatgtag cggaagagga agccgtcccg gtccagaaag 720 

aagcggcctt tgctgtcccg ggccagctcc tgcggctgct gctgcgtgaa catgcgccag 780 

agcagcgagt cgggcaccga caccaccgtg cagcgccggg tcacgtacac ctggcccccc 840 

acgttcagct ccacgatgtc ggggaagagc ggtggctccg cggaggacga cgaggagcca 900 

ctgccgcccc cgccgccgcc cccgttgggt aatccacgtg tgctgtccgc cagagccatg 960 

acagagaggt ggccgggccg ggacagtggc aggaagccgc gctgcactca ggagctgcaa 1020 

ccgccttccc cggagccccg gaacccggac gctcgctcag ccctgcgccc cgccgccgcc 1080 

gccgccgcca ccgccgccac cgccaccgcc gccacctcct agagccgcgc ggagccgagc 1140 

ggtgcgagcg cgccgctgtg cgcccccttg agttccagtg cgctccgccc gccttgcccg 1200 

cgcgctccag gcgagtgccg ggtcgcggcc cgcgcgctgc ttaagtagag ccgcctgctc 12 60 

tggtgccgtg gcgcgcagga ggcgtgggcg gcggcggagc ctcgcttacc gggcgagggg 1320 

aggtggggag cggggactcg atttgcatct taaacgcgga cgccacgcgg ggctgctggc 1380 

tccaccccgg ctcgcgagcg ctgggggcgc ccggactccc acccgctccc cacccaccag 1440 

ccccgccccg tggtcccgcc caactttctc cctcttccga agaggacccg cccgagactt 1500 

ggcggcttcg gaagacagct ccttggaggg acagtcaccg cgccccctcc cctctcgccc 1560 

cagcccgcac ccaaggactc tggaaggagg acgtgaactt tcggaagcct gtcgccccgg 1620 

gagtggaaat gcgtggccgc ctccggccct cccgcctgcg gaactgggcg cttccagcag 1680 

tgcagctgcg gcctgccatc gttggttgtc gtgtcactga aagtagagag acgcgaaaaa 1740 

cctggagaga cgaaagagcc gccccagtcc gagcgagcca gattttgttc aagttgcctt 1800 

gcagccagcc tggaaaataa ataaataaac ctagaaacaa aaagcaaacc tcagtgtccc 1860 

gcagtaggat gctttacgat agtttatttt ctggctgcag agacaaaatg agatgaagct 1920 

tcaaatctgc tcttaattgc aattccctct tcccgcaggg ccttcgaggc acttcacttc 1980 

gaagtgaagg ggtaagactc cggaacgttc ggtctccggg tcccattcag ccttcgttga 2040 

atcgcgccct tagactcaca acagttgcag atctaacgta atttttaaag ccacctttgg 2100 

cgcctgcatg tttatccttc caaaagtcca ggcgcccctt ctctcccgca tgcctcggtc 2160 

actcttcggc cctgtccaag ttcccgtcag tctcagaaag ggatatttat atatctgaag 2220 

gcccttgtct ggatcccatc tctcaactcc tgctgcctac acgtttgctt ttccccctcg 2280 

gactgcagcc cacagaaccg ccccctcccc ctggaagctt tccctggctt cccaggtggg 2340 

tggggcctct ggatggctcc ctacacactg tctgcgctgc tcccagcacg ttctgaaatg 2400 

tcatatttat gagtgattat ctgatcactg atttgggtat gaaaataagt aagagctcct 2460 

gaatgctgca aaaagagaag agtgcagagg agggtgtcaa gaaatgattg ggtcactgaa 2520 

tgtttcttgt gtacctgcta ttatttgcag ggcaccaggg agtgaggaat gcaaaggcga 2580 

ataatctcca aagcgctttc agtgtagccc aataagcaaa gggtcagggg tctgcgcagt 2640 

tatcacatca gtttgaagta tagaccacac atcaggataa atcaggcaag atctttgttc 2700 

ttgataaaat ttatccttct gtgtaactac ccagtgagct tcacagtctc ctcacagata 2760 

aatgtcattt ttattttgaa atgtttatat ctgctcaaca attaccgcgc tcatgtcagt 2 820 

gtgctgaaat gagatcaacc cccaaaccag ttatcttaat tgcaagtaca gattttgata 2880 

ctgaatgtat ttcaacattc tttttttgtt gttgttctgt tctttgggat aaataataga 2940 

taacacaaaa cttccaagtc ccagtgcagg atttcgaggt ggttgtgcta ctaggcgtta 3 000 

tattagtgtc agagacctga caaaaccagc ctcttttcta caaatccaa 3 049 

<210> 2 
<211> 2407 
<212> DNA 

<213> Homo Sapiens 



<400> 2 



taaggtctgg gtattctcag gcagcaggga caaggtgggc ttttttcctg gttgctaaac 60 

ccacgtcaaa gtcgagctca gggactggag ctcaagaaac ccaccgccca ttctccagtc 12 0 

cgaccgggga cctgcatgca cctctgccgt gctgccctga gtcctccaat cctccacact 180 

cttcctctgt tatgtacacg tctccaccca ggcctgcaaa agtcccagct tcctccaggg 240 

gcagggaccc gcacgccggc ccagggcttg gcacgcgggg atgctgaaac agggccaggc 300 

ctggtttcca gccgatcgtc agagtcccaa ggcccagcaa ccttcctcac aaaggcctcg 360 

ttaagaggcg aggaaacaag agccgggaga ggggcgcgga acggcgggcg ggacgaacga 42 0 

ccagctccgc gcctccggcc agctgcgtcg agccaggggc accgcggctg ttgtgcggct 480 

ggaaatctag gaatgggaag gttcggggcc tgctcggctc cggaggcagc tggcgggtcg 540 

tccctggcgg cgttggagcg gtcagtggca gccgggcacg ggcgaccggg tcgcccgggt 600 

cgccctcaga ccgtgactcc cgaaaaacct tgcgggcggg gcgcgcccgc gccgtctctt 660 

gccggaaggt gcgagttagt gcgctcgatt gtgggcgggg gcggaaagag gcgcgtttta 72 0 

aagtggtaac agatggtttt cttatccaat aggattaaaa aatttgtcct tacccggccg 780 

accgcggaag tagagtaggc gggcggccaa tggggacatg atggggggcg gagccgaggc 840 

ctccgaagcg gaagtgggtt gctgttgagg cggcggcatc tttctcgagg agctctcctg 900 

ggcggctgaa gaaggagctt cttctccgga gtgcgccggc ggtggcgcct gcggacctaa 960 

ctagctccag gttaggccga gctttgcggg aaagcagcgg taagtcaggg ccttgcagat 102 0 

gcgaggttta ggcagcttcg cggcctacag aggcctcggc ccgcgcctct tgggggagcc 1080 

gcgctgcgcg gcttgaccca gccgaggctt tgcagcccgg gacctcgagc cagctctggt 1140 

cgctcgcact gccgtccgcg cgggcgcacc gagcccggct tggcgcgggc aacagaagtt 1200 

aggaggtctg cgtctgggtc tcggctcacc ctggggggcc gcggccatgg ggcttagttc 1260 

ctagcctagg aagggaaact gagactctgg gaggggcagg aacgccccca aggtcacttg 132 0 

gaaagtcggg caggatgtgc tgttaggggg aagacccggg cagggttttt gttccccgct 1380 

gacgacgcct cctttgtgtg ttcgcgccgc cgccccgcca tcgtggggcc tgcgagtttg 1440 

ccggggtgcg tgggccgcgt ggcggggcct tttgtaggtc gggaggatct gagtacgggt 1500 

gcgggcctga ccgtgggggc gccgaggtcg cagtctaaaa cttagtaggg cctcgatttc 1560 

cgggcgcgct tccgggcccc ggctggtggt tggtggaacg tgcgactgtg aggcttgcgg 1620 

cccagccctg caccgctcgg gcccttcacc gctctggcgc gcctatagac aggtgtatga 1680 

agattctcac gacccgaaac agagttgcta gtaaacaccg cttttccgcc tttgatccat 1740 

cggggaagag ggaaaaggat agagcttggg caagccgttt tggtagggat ttcagctttt 1800 

gtctttcact tgtcagttcc catagacgtt cacaaactta ataatcttcg ttctgtttct 1860 

gcaccaagtt cttgagccag acgtagggtc tcagctctgg agcctggctt agactgtcca 192 0 

actgactggg gagactgagg tccagaaaag tgaagtggtc tgcccaaggt cacatagcca 1980 

gctatttggc agcagatgag gttaagtcct acctgcaaga tttgggtttt gaattcattg 2040 

accaggagtt ttgggaccac tgtcaataaa agagacattg aagggaatct tttgttactt 2100 

tcttggtgat ttgcttttta atggacaagg acatattggg ttcagtttta tctgtgagtt 2160 

tgaggtgaaa tagaggcatt cgagtagcaa gatatattgc tggcttttgt attgcctgaa 2220 

tttgagcttc caaaaatctt actttaacac atcgtttatt gatcttttct tgaattacta 2280 

cctttgtaag gaccttttgt aaacattgtt tttctaatct tcatgaaatc ttaatgccat 2340 

acgtaaacta tttcttttta tataatgtat gcacatctgt gctttgtaca taaaatgagt 2400 

aagattt 2407 

<210> 3 

<211> 3952 

<212> DNA 

<213> Homo Sapiens 

<400> 3 

ggccgcagcc ggagaggaac aggaaccgca gtggggacgc cctggtcccc gggcccacag 60 

catccgggac ggacgcgcag tagcgcgggc cgggaactgg gtaccagggc gggatgggtg 120 

agaggctcta agggacaagg cagggagaag cgcagcgggg tgcggggaac cgcacgccct 180 

ccctttgcct ctgcttccca ccccgaggcg gcagggcggg cgggcgcggt tccgggggtg 240 

ggcgggctgg gcggggcgga ggcggggccg cagcactggc ttcacccagc ctctcccgcc 300 

cgcagccaga gcgagccgag ccgcggccag ctccggcggg cagggggggc gctggagcgc 360 

agcgcagcgc agccccatca gtccgcaaag cggaccgagc tggaagtcga gcgctgccgc 420 

gggaggcggg cgatgggggc aggtgccacc ggccgcgcca tggacgggcc gcgcctgctg 480 

ctgttgctgc ttctgggggt gagtgttagc cggagggggc ccgctccctt tcccgggatc 540 

agaactccga gaagagccgg gcgccgccac caaggaaaca gaacagagca ttggggtccc 600 

agatactgag ggtgggtggt gggaaaggac ctctgatgcc gggaccacga aggagggtct 660 

agggttcccg gagcgcagag gcgactctcc agggtggaga tgagggcaag accggagcac 720 

ggatgccggt cctcaggtac cgcagggggc ggtgggggag ctgggagggg tctttcaaga 780 

gorgggcatgg ggctctccga tgcccaggtt cttcggaaga ggacactcga atgccgggat 840 

cccgaaggga ctttcccctc agcatctcgg tctctggaga gtcgtgggac agaaccaagg 900 

cgagagaagg agggggaact ggacggggac tagagatgag aggggcgagc tgggctgggg 960 

cggggagccc gggacgacgg gatggaacaa tggaggaggt cggagggact ggaggcagag 102 0 

gggacccggc aaggacaatg gggaccggcc agacaatggg ggggaggggc gtcggaggga 1080 

gaggttgggg agcgggagag aggttgagaa gtgggttagg gagggcagaa gaggggcgac 1140 

ggcgagggcg gcagagagat ggaagcagaa ggaaaggaga cagagcgggg cgcggaccct 1200 



gaagtggtcg gggtcccgcc tgcccgctga gggacagtga gacgagaaga gagagcagag 1260 

gcttggacat ttccagcgcc tgtccccgcg ctcctgctcg cggcctctcc cagtcccgga 1320 

attggggctg cggatctaag gctgggaggg gagtgcccac ttcgccgggg cgaacccgct 13 80 

cccggggttc ccccacggcc ggcgggctcc tccgacgcgc gtgttctcac ggcttgaagc 1440 

agcgcagcgg atcgcgaggg acggagcggg tgtgccccta cccgggctgg cggtggctac 1500 

gtctcggggt gcagagcgcc ggggcgcact aggcggggac ccttcctggc cgcgcgcagc 1560 

tctggctacg caggcgggag gcggcaggca gggggcgctg cggggcgagg caagtgcagt 1620 

ccgccgctgc ctgcctgcga gtctggggct gcgcgccgct ccccacctag ctgcgcgacc 1680 

cgagcgccgc gcccaccgcg ctccgtgggc tcccgcgcct cgaagggtcg ggatccggtc 1740 

tgagcctctc ctggctctcg gtgcaagccg ctgggcacct aacgagacgg gggcgcccgg 1800 

acgcctcgga ttctgtagcg cggtatcccg ggccaccgac tcggcaaagg tcggcgggtc 1860 

ctccgggtgg tccgacgaga ttgtcagatg tggcctgctt tatggccatg aaccgaatgc 1920 

cctggaaggt ctggcaacgg aggaggcgtc ttcactctcc ccgccagcat ctttcatcag 1980 

agcactgcgc cccaaattcc ggctttacag atacggagac tgaggctctg agagtttaaa 2040 

cgacttggcc aagatctcac agcaagttgg gccagagtcg tggctagact ccccgacctg 2100 

attcccagct cggtactccc ccaccttttc agggggcagc cgctgtccca gctcggggtc 2160 

acggctgcgt ttcccagcag ggcggggggg gttgacactg tccccagagt ccgtcggcgg 2220 

tagagcctgc gggtgaggtg ggcagagatt ctgtccgggt gcgggcccca gcccgaaagc 2280 

tcagccccag ttccctggag ctgtcaggac ctgtggctgc gtccggggat ttgggagcga 2340 

ccgcgggcac gtctgctgac taacgccgct ggttagagac gctgctcaga cgcgggtggg 2400 

cgagcgtaga ccaggagagc ggggaggagg gaccggatcc cagaggggca acgacttggg 2460 

ctccgggtct gggcaagagc gagggtcccc gcgcgaagac ccaaggaagg aggggagctg 2 520 

ggcgtggagg acctgggcgc ttgggaaggt gtaccccgac aggaacctgc cgtctgcagc 2580 

ccctgcggct cccgcctccg cctcggcgcg cggctgctcg gtctgggatg gctgtgcgcg 2 640 

gtccgagttt cttcctgggc cgcggtgaga gagagagggc tgaaaccaga gcgcgtcccg 2700 

gcgggtcggc tggcgggccg cgccggtaat ggaggcactt tgtcattcag acgtctgtaa 2760 

ccagagccgc cgggctggct aatgcgccta atagggatgg aacgagggca gcaaatgggc 2820 

gtgcgtgagc ggccgggctg agctgggtgg atggtggatg gggaggcagc tccgcggggg 2880 

acgggcccct accgcactgt tccagcccag tctggctcaa gcgcctcgct tcttccctgg 2940 

gggaccgcgt ggggaggcgg gaataatcga tgtgctgagt tcctactaag tgccttgcac 3000 

ttccctggta gtttctcttg aattatttaa tccttataag gtcctagcaa ggtgaattta 3060 

gttattccca cttcgcagtt gaggaaacag aaccttctag actttcaggc ccgagaccag 3120 

gctcccccag tgagtcagtg gtggcgtctt tgttggaacc caggatgctt ggcctgggat 3180 

gttctctttc ctacatatgc caggctccct cccgccccct accttctaga tcatggaagg 3240 

agtcgtggct ctgagagatt aaatcgtgcc ctatgccttt ggccccacac atttagatct 3300 

gagccccttc tccacacgca gaggccctgc cccctctcaa gtctctatgc tacattatct 3360 

tgggtaatta gctcctttcc tcccaactcc tcccctatcc aaagggagtg aggggtatgt 3420 

gccagtgcct cccctgggtt tgcatcgacc tcaaaaggcc ctccaggtgt ttgaaaatct 3480 

ctcccatccc atggcggtga ccacggcccc attcacttaa gagctctttg ctgaagctgt 3540 

tggaatggcc agggctgatt atactgaaac attcgtatgg tttcaaatac tttcaactgt 3600 

ttcgctgttt ctctcttttc tctaattaaa aaaaatgaaa attcttaaaa gtcaattcat 3660 

ctaaggatcc ttccacctta gttccttgta agccttgaaa tggccaaaac aaaaagatta 3720 

caagcgttct taaaaatgag atgaaatcat tctgtcaaat gtgaaatgac taagtgtaac 37 80 

acatacaaat ttcagggaaa tttatctgtc atccctataa attaatcctt ttggatctga 3 840 

ttttggagca attagcctga atgttagtca cccctcccta cttggctaaa ctctcccctc 3900 

ctgcctgggg tctccttaag atgagatgct tctttgataa ctaacattta tt 3952 

<210> 4 
<211> 2436 
<212> DNA 

<213> Homo Sapiens 
<400> 4 

agttctggat gacaaggact aattgcccag ggcgttggcc gagaggaata ggaagagcgc 60 

cccgtggagc agggcgggcg agaagagccg ggcggccgcg gggtcctcgc gcatgctggg 120 

caggaagagg aagagctgca gcacgaaggt caccagggag atgcacaaga agtaggcggg 180 

ggccaccacg ttgagcagcc gcagggccag catcctcgat tacattcctg cggggcccgg 240 

ggtgagaaga ggactgactg actacggcgt ccagggggcg ccctcgcccg ctcctccgtg 300 

ccgcgcgtct ggggcactgc acgcgactcc acgcccccaa ccccctgtcc cgcggatttc 360 

ccctccccag ttccctaacc ccggatcagg caggccagag acccgagccc gcagcggcct 420 

cgggtctagg ccccacctgg gggtgagtgt tgcctggggt tttgagcgag ctcggcgcct 480 

tggcgcggca gagtctggca caggagggag ggactcgttt gcagtcttta ggagttgtga 540 

gggtgggggg caccggggga taggactcct cccctctgtg gatgtggaaa caagcccagg 600 

tttgggattt ctggttcaga cctccctagg aggagaaagg gaaggatttg gggtcggggg 660 

caggagaatc agcggcactt gacctgctgg aaatcctgct tcacactccc ctttctccgt 720 

ccctgcgtcc ccacccacac acacatcttg ggcagcactc agggacctca ccaggttacc 780 

caaggctctt tggaggtatg tgtgcgtgag ggaccacgat gtgtgtgtgt gtgtccttgt 840 

ctgtgactct gtacgtgtaa tctgtgtgcg cttgagtatc tgtatttgtg tatgtatata 900 

tctgcgtgta cagacacgca gcctctgcta tctagcttta aaaaaatcaa ttgaaacaat 960 

cgatacactc acactcgctc tcgggcaccg ggtgaggtgg gggcgcaacc agaatacctg 1020 



tgtcagcggg tcctggccag cacatctctc caacgctggc tgcccgggct atttctttgc 1080 

gaagagcgtt ttcatttgag gcgaaattaa aatcccccct tcgcgcccgc ctcccgccct 1140 

cctgctgaga gaaacccaaa caaccctcat ggcgccgaaa tcctttccat gccgacaagc 1200 

ccgccctggg cgcacgagtg gatgcttggc cagcctttcc tgggatcgac cccgccgccg 1260 

tgctcagccc tcaccacgtc cccatcccac cccacgcctc acagccgggg ttcctggcca 132 0 

gcttcggaag ccaccgagaa ataggattcc gtgcgcccga gagaactttt ccaggggcta 1380 

aggaatcggc cagccggagg cgcgagaaaa gttctgggaa ggcggttgca cctaggatgg 1440 

gtggatgcca cggggcctcc gtccaggctg tcccgtccgc acgggtcgac tggtcacctt 1500 

ggaatcccct ttgcaggtcc cagcgccccc cgggaacccg cagcctccgc ggagagcgtg 1560 

ggcctctccc taccgctggg gcgcagcgca gtgcacgcct gagggtggtc gccgggggct 1620 

gggcacgccc ccagtcctgc gccgccgggg gctgcggcgg tgctgcccac cccagagagc 1680 

cctcggcctg gggctccggc gaagcaagtg ccttcccggc gccggtcgcc aggggggcgc 174 0 

gggagcagcc agatgcgccg cagcgctggg aaggcggcga aggacagggg ctaggggagt 1800 

gaggggcgct cggcaggcag cctcagccct ggccctgcgc gggagaaggg acagcagaga 1860 

ccgcccgtgg ggccccgggg tgtagggagc tgtccgttca gccctggcgc ccgcctcgcc 192 0 

cgcggcagag ggcggcacag ccggagcctt ggaaagaccg gcagcgccgg cagccgcggg 1980 

cttctcggcc actgcctccc ggacgcacgg gaagccgccc cccgcgccgc cgccgccgca 2040 

ctgccgccgt cgcagagggg tgaggaaatc aactcaccga gctctggtcg ccgacaagag 2100 

gagccccgga cgccggctct cgccctgccc gaggctgcaa agttgtggac tcggcccggc 2160 

tggctcgcag cctgcgcttc gcttcgggaa ctgggcaagt agcggggatg tggggaggag 2220 

ggagcgggca gctgctggct tcccacttgg gcgccagcga gtaagggcca ggaagcggcg 2280 

gggatggcag ctgggcaccc cccagccccg ccacccctcc ctccgctcct gggggcggtg 2340 

ctctggccgc cagtctcagc atcgtggact tgccccctcc tcccctctcg attccccttg 2400 

agcgggctgg ggcgccctgc ctggtttcag cgcccg 243 6 

<210> 5 

<211> 2470 

<212> DNA 

<213> Homo Sapiens 

<400> 5 

agatttactc aaatttaaga atgagaatac aaatccacat cttgaagtgt ttcacagaaa 60 

ggtctatctt aatgtctgga gtatatattt caatgaacat tcattttatt ttatttctct 120 

ccattcctga atcaagcaat cttgaatcta aagttgctat gattagcact gaaaagacca 180 

ctggactatt aattgtgtga ctttgggaca gtaactttct gcaccttagt ttgtttacat 240 

gttatacatg aaggttgaag tctgattctg ctctgtgact atcattctaa acatctgatg 300 

aaatcaaatt tcagtgtttg gaatggtagt acaataaatt tactaagaat aaataattca 360 

ctgcaaaaac acattgattt ccaaatgatg taactgacag ttatattact gcagagggct 420 

gataaataac aaaagaaatg aaagatgcac atggtgagaa ctgaaattat cctgacaagt 480 

cttctacctg tttatcactt aaaatcaatg accatgctga atgcctacaa attacaaaat 540 

ataaaagaaa tcttataaat gcgcatgtac aggagtctaa gttactaaaa gttttaaagc 600 

ataagtttaa accaaactaa tcaaagaagt tgagaggaaa aattggcttt catctttaat 660 

cactactgtt ttgaggtcct atgtttaata taattttcta agtagaggct tcagagagaa 720 

gagttgtgag gatactttca tatttgtgta gaaggaaaag tttgccatcc attctagtat 780 

ccctagtgtt atactgatgt gcaccttgga tttattttgt tcctattgta taaactcata 840 

cttgacttca aagaaaagga aaatccaaag tccctctttt ctaaggggac agaaatcctt 900 

tgtgtcaact gtttgaccct tttctctgta aggtcctatt ggaaatcttt tgtaacacaa 960 

tgcaggggac tcttccatgt gttgatgctg tttacacagt ggggtgggcc tgactgaaga 1020 

aaaaaaatcg catatacgca tgaaagatta tggtcttatt tccggaaagc atgaaaggtg 1080 

attgatactt ccaagaagtc cctgttactc aggaaaatta tcaaatattc tactcagaga 1140 

tacttggaaa gactgaagga aaggaagaac gaagaaagca gaatctagac ttatgtgggg 1200 

agagatttgt ggcagaggaa aagtattctc tttgaatccg acaagggatt tgcctggggg 1260 

aatttcctgt ccagcctttt attaccaggg tcttttgaag ccgggctccc cattgggcag 1320 

ttccctggga gtgcagtggg gaattcttac actttccctc taggtccccg aaggatctcg 1380 

ttttctcagt gtctctttca ggttggcagg agccttgagc ctgacacttc cctttgatgg 1440 

gacaggcaag ctctgtgggc gcgtaaacac gctgtaacca agttctttgc tgattttaca 1500 

gttttg.tg.tg ctcccgagaa gaagtgatcg tactcaattg tctattgctg gcctgccccc 1560 

taagagcctg ggggctcctt tcccctaacc cagaactagc tgcacggggg gcggggaaat 1620 

gggggtgggg aaggagtggg agggcagtgg tttccgcgag cagagcgatg ttactgagtg 1680 

agtccctgaa tggggagcgc tgctgtcccc aagccgattg gtacttcttg tcaggaagaa 1740 

acgccaagag gtgggagtgc ctggggaggg aggcaggcgg tccctaccgc aggcgcgggg 1800 

agctgccttt ccgcccctcc gcctgctttc caagcctgga ctcttaggag tggctgaagc 1860 

tgcggagcgc ttttggagcc tgtgaatgaa ccctcctcct ctccctcctc cttcttctcg 1920 

ctgagtctcc tcctcggctc tgacggtaca gtgatataat gatgatgggt gtcacaaccc 1980 

gcatttgaac ttgcaggcga gctgccccga gcctttctgg ggaagaactc caggcgtgcg 2040 

gacgcaacag ccgagaacat taggtgttgt ggacaggagc tgggaccaag atcttcggcc 2100 

agccccgcat cctcccgcat cttccagcac cgtcccgcac cctccgcatc cttccccggg 2160 

ccaccacgct tcctatgtga cccgcctggg caacgccgaa cccagtcgcg cagcgctgca 2220 

gtgaattttc cccccaaact gcaataagcc gccttccaag gtaatcacgt ttcttttgtt 2280 

ccccccttaa aaaacaaaaa caaaaaactt atagaaaaaa acccgcgagc ttagaaaaaa 2340 



gaagcaattg gtagaaggct ttaattaagg caaagagctg taaggcgaag ttaagaaaat 
gtaggcactt aaaaaatgca ggtaactttc ataagggctt ttggggagag gcatacagag 
ggaccttggt 



2400 
2460 
2470 



<210> 6 
<211> 2470 
<212> DNA 

<213> Homo Sapiens 
<400> 6 

aaaattgaac atctgagaac actggaccca cattcctgag ggcaacaatc tgctagagtt 60 

gagcagctgt tccttttagt tatggcatgg atattccaat ttgccatttt ccctatcatt 120 

ccccattgtc ttacacaagg ccagtctcac tcatttgctt tacttgtctg gcctctgtag 180 

ccatttgaat ctgcaaacac tatctagacc tttcctgtct tccatcaaag cttactttgt 240 

tctgtcccaa gtccttcatc agtgtttcaa agcaagagcc aggtaatctg atcaaatata 3 00 

ggtcctttaa catgtgcttt ctggagacag catcttaaca gaaagagatg actctgcagg 360 

tagaaattac aggcaccact ggattacaat caggatggca actgtctgtt ctactttctc 420 

tgttatcttg acgtagccat acctcactgt tctaaagcag accaaaggaa ctggttctat 480 

ggtagaacaa tggacgctgg tcttatgtat gaaatcttct caagctgcac ttttatagat 540 

caccctagtt ccaaaagatc caaagctacc acgggctcct ctgacccccc catttcctcg 600 

gaggcttcag gatatccggc ctgacggcat ttcccctcac ctgctttcgg ggcccgtttt 660 

gcgctgggca gttgcgccca aggagcgcgc gcatccagca tgagctcatt tctcatgggc 720 

gtttccagag gcccggccgg gccgcgcaca ccaagcgcag caagcctctg tatcaatggc 780 

ccccgcggct gggcggggcg cggccgccca aaggcgctgc ctttctggaa gctttgttcc 840 

cattttagcg tctgagagct tgcagccggc tgggaaggcc cccttggtcc gtctggccct 900 

ttcggggaag aggccaacac tcggcacacg cgatccacgg cagaggggag ccttgggcgc 960 

gcagaattgg ctgcgccccg ccgagagcct cctgtgggtg gggagagccc ctccacccct 1020 

ctgctcgctt gagcgctcag agcccagggc cgccgaccgc agcactttcc gatttgctgc 1080 

accgagggcc cgcggtccct gtgtgcggtt tccaccgttg ttggaggcgg ccgcaggcga 1140 

accgtcgggt cgtcagccac gacccgagtc aggcatctcc ccgctcctgg gaccggggcc 1200 

gaaggccaat cacactgcag ctaggtcttg cgattggacg gcagtgagag ccgattggcc 1260 

gccgccgcga gtttcgggct ccctccctct cccttttgcc ccaactccag cgcctaagct 1320 

tcaggccaat gagacagcgc tttatagacg ccctcccttc gctttcttct ccccaccttg 1380 

gagagggagg ggaagtcctg actggccaga ctgtcctcgg aagccccctt ctcttcacca 1440 

atcaccgaag gaggtacgct cccagcggag ttttccagcc aatcacaaag cggcggcggc 1500 

gcccagccgt gcagtttcac cagcgtctct gggtttcacc gtcctcaact cttcaagcct 1560 

cttcgtaagg gccggcgact ctgattggcc actgttgcca ttgtcgaatg tctcctccag 1620 

ccagccaccg aacaaggcga attcaccctt tccgttcggc taccttcggc attttccgct 1680 

ctttgggcgt ggcttcccag cgtcactttc taattggttt ctcaggctga tcggcttttt 1740 

ccgggaggag ccgcaaacaa acgacgtccg tgattggctc cgttcgggct tcggctccca 1800 

gccgaagcgg gcgagcgtgg ggctcggccg gcgattccca gacgcctgtt acgcgggcgg 1860 

cggggcgctg ggcggtgtaa ggctgggtgg gggaggaagg aggtggagga cgagtaggag 1920 

gggggaggag gagtggggaa gtgcaaggcg gctgcgcaga cagcgctcct cacacagagc 1980 

agctcctgac ccgggcgaat gcgggcttgt gccgccgccg ccgccgccgc cgcccgggcc 2040 

aagtgacaaa ggaaggaagg aagcgaggag gagccggccc cgcagccgct: gacagggctc 2100 

tgggctgggg caaagcgcgg acacttcctg agcgggcacc gagcagagcc gaggggcggg 2160 

agggcggccg agctgttgcc gcggacgggg gagggggccc cgagggacgg aagcggttgc 2220 

cgggttccca tgtccccggc gaatggggaa cagtcgagga gccgctgcct ggggtctgaa 2280 

gggagctgcc tccgccaccg ccatggccgc tggatccagc cgccgcctgc agctgctcct 2340 

ggcgcaatga ggagaggagc cgccgccacc gccaccgccc gcctctgact gactcgcgac 2400 

tccgccgccc tctagttcgc cgggcccctg ccgtcagccc gccggatccc gcggcttgcc 2460 

ggagctgcag 2470 

<210> 7 
<211> 4721 
<212> DNA 

<213> Homo Sapiens 
<400> 7 

acgtgaagcc cagcacctcc ccaagccttt gcctacacac aaaaagccac gttctagagg 60 

ctctatttct aacttaagat aagcaggata ttttaaaaat catctactca ccagttggtt 120 

tgagaaaatc catcgggtat ctggagtaag gagggggggg agactctgaa attaaaaaag 180 

caagagaaaa taaaggccca gccatgagaa gagagataga aacttatgat aacagaggca 240 

ttcttttagc ccaactgttt gtcttagctg tgggggttgg aggcagggcc cgaggcggtg 300 

attgccttga tatttagctg tcagataaac aaaagaggcc gcctggcgcc agcctcggcg 3 60 

ctccgctcct ccacgtctgc ggccgcagcc gccgcgctcg gctggcccgg ctggaaacca 420 

ccggctcggc cggagccgcg cacttcacag ttcacagggc ggggggcgac catgccccct 480 

ccccgtcctc ggactccaga cccagctccc agcaacctcg gctccccgcc ccggcccgcc 540 

ccccaggcgt gcaattcacc tctgcccccg ccggtccgct cggcactcgc gcgggggaca 600 



cggggctgcc cattcctagc gcactcggga gatgcatccc aagcgtgctg gggaacgtgg 660 

gggatctgcg ggcgcccggg gtgaaccctc acatcacact ctgtgtacag agacgtagag 720 

taaaggaagg aatggagaaa gctgagagga taaaaggaaa ggagtttcag ggtctcagcc 780 

gagaagccca ggccccagcg cagtaagagg cgacaacgtc actgaggcgc cggggcccca 840 

ctaatctaca gacttatctg tgcggaagga aggaaaaagt aggggacacc caacttctcc 900 

tgcctgtttg ttcctctgca ccctggggct ggcagagttg aaaggagaca gatacccagt 960 

tcggcctccc atccaactct ctttgcccca aaactccaaa ggtgcgcggc cacgacagta 1020 

tctgggccac tggtgttcga cgtgattctc cgtatttact aaaggagtaa atacggagac 1080 

cttccagcct ttatagcacg cctctcccag ccttccttgg cgggcctgga ccaaatccaa 1140 

ctcggccaca cacctcaacg cctttccggg tgcggggagg cgtgaccccc ccacctccct 1200 

ctgcaccccc ttgttgatcc ggaaatgaag gcgtctaatt tttaaccgca gactgcagga 1260 

gcagtgcccg tagacccctt cggggcaaca caggagaaaa cctggaaggg aacccccagg 1320 

gaatgccctt tgagtttccc cttgtccctt ctccactttc agcccccagc ctcttcccag 1380 

gagggtatgc acactctccc aggagggtat gcacactctc ccaggagggt atgcacactc 1440 

cccctggagg gatggctgca caaacgcact gccccagggg cttttcttcc agcactggcg 1500 

ctccggaaaa tgttgggggt agggggacaa cttctcacct agttcgcaga gtcggctaag 1560 

gtgatggggg ttgcagcaca ccagctcggg gttgatcttc ccgtaagatt cacagcaaca 1620 

cagcctcttg actcccgagg aatgcctgag atccggccac ctgaacactt tgcacagcag 1680 

gagggggagc gagtaggacg agggcggctg cgcaggctgc gcgccggcgg gcgcccccgg 1740 

gcccagcctg cagtccaggc ggccgggcag caggaggcac gcggtgcgcg tcccgccgcg 1800 

ggactccacg gcctggagca gcagctccag ctgccgctcc ttcagtttct tgagcaccga 1860 

gtgcgtgagc gccttcagat ccgcctcggc gcccccggcc gcgccggcgc ccgcggctgg 1920 

cgggtgggga tggtggtgac ctttggcacc tcgcaccgcc ttgcccaggc agcatccagc 1980 

cctgcccggg ccgccgccac cggccccatg cgctcggctg tccgtcgccc cttctccccg 2040 

cagctcgcct cctcctccac ctccccctgc gccctcctcc tcgtcctcgc cgccgggcgc 2100 

acggctcctc cagagacgcc ggacgagcgc agatcgtttg gtcctgaaca tgcggggcga 2160 

ggaggcgagg agaaaagtcg tttgcctgct aaggagcgaa catgacctcc gcacaccatg 2220 

aagaagtcgg gcgccgagtt ggggcagcag gcgcaggcga cagcagcagc agcaggggcc 2280 

cgggcaggag cggcggcggc ccgaggggcg ctccgtggca tgcgccagtc tcccggaggc 2340 

cggggcgcgc gcgggggccc gggggcgccc gccggggatc gggggcctgc gctccggctg 2400 

ccccaccccg cgcggcccgc gccctgcgcg gctctccggc cccggcgcgc cccggaggaa 2460 

cccggccgcc gcttccctgg ggacggccga gcctgccccc gtcggcgcct ccccaaaaag 2 520 

aggccccccc gcagtggctc ccgaatgtcg ggctcgccag cctcggcttc ctacatggaa 2 580 

ggtccgcggg ggcaaaaaac gaaaggcgtt cggctgggct gttggaagaa ggaaaaagcc 2 640 

tctttccccc ttgctaagca acttaacttg ggggtgggga gaagcaggca attaaaaaaa 2700 

aaaaagcaag cgatttattt ttttcctcta tatccttagt aaccggatct cctcgaattc 2760 

cgcgcacacg aagactcagg ggagggggcc gagtggactt caccccgcat gagacgtctg 282 0 

gcaaaataag aaggctctcg caaaacctaa caaccaaata tgcaaagccc caaatgaaaa 2880 

ccaccacctc ctcgaacctc agaggtctgg gggcgtccgg ctggaactgg ggtttaaaaa 2940 

aagaaaatgt ttacaaagta taacaagatg tttgatgggt ggaaaaatgt atccacgagt 3000 

tacatccccc cgtttccttg caaagccccg ctggtcttcc tctccttttc ttctgccaaa 3060 

aaaaaaaaaa aaaatcgtgt atttttctaa tccacagaaa gctttggcta gaccgcttca 3120 

atcctgcgca tctgggtggt ttaggggagt ctctggtctt tccccctgcg ctcctggggg 3180 

cccaggtcct cggcggggac ttcctcgagg ctggcgcggg cgcaggggca gaagatgctg 3240 

cggcggcggc tgagcccggc ggggctgaca gcgcggggga gggtggcgcg gcggcggcgg 3300 

agggccccag acgggtcgcg cgttctcgcc ccccccgggc acaagctgct tgctagtgca 3360 

ggggccgccg atgtcccttc ccctggccgc ggctggccgc cgaggctccc cgcatgggct 3420 

gctcgcctcg acccagctgc ggcggcagga ggccccggtg tcctctcggc gcctcctcct 3480 

ccgagactct cctcgtcgcg cccgggagcc tccttgtccc cggtccgccc tctcctggcg 3 540 

ctccggtcct tctgccgccg ccaggggctc gccgcgccgc actcaggggc tcaggggccg 3600 

ggcgctcggc ggctcggcgc cgacggactg gctctgtctc gggcagctct ctcccgcgcg 3 660 

gcgagcggac cgagcacggc gcccggctgg ctcggctggc gcggctcggg gacaggatct 3720 

tccgtgcgcc gagcagcaag cgagtgtgcc cggggctcac cgcctcccgc aaggcctccc 3780 

gcccccgccc ccttccctcc cccttcctcc cccttccctc ccccgccccc agccgccgca 3 840 

gccgcgccgc ctcctccccg cccccccgca ccccccctcc ggccctctgc tcggctggtt 3 900 

ccactgcgca gtggcgcgcc cggctccggc ctcgtcacgt ggccgtctag acaccctgtc 3 960 

gctttaaaaa aaaaaaaaaa gcgattgtgt ttcgcaaaca acagatcggg tttctaaaag 4020 

ctatttctcc ccccaacccc ccgccaccgc caccccctcc cgggtctgta gagggggtac 4080 

cgatggaggg gagagagata ggtggggggc agagaagctc ccagaatgga ttgagccccg 4140 

gccggagcca tggagaaatt ggaaaagcag ggagcaccga gcgggcttcg ccgcgagttt 4200 

tggagctgag cgagcgggtc ggtggcccga tttcgacccg gctgggtttc gcggtggcca 4260 

tctcgcgcgc gctcgcccta gcgcttcatt cattggtttt gttttaaagg ccctggcggt 432 0 

ggatcccctg gccgcgcccg agggcaaggg gaggagagcg ctgtctcggt ttaaaagaca 4380 

tttataccgg actggaccga ggccctggga agtgtgcgct gaggggaaca gccgccgagg 4440 

gcggggaggc ggcgtgaata tgacctcagc ggcggccgcg cgctccctcc cgccctctca 4500 

gctccgggct ccggtttcta ggactgcctg gagaagtgtg tcttgtgcac agctctggaa 4560 

tgcatttggc cggctgacga gctgtgaggg gcagcatccc ggcgggagaa ggggagcggg 4620 

ggtgggggct cgcctgcgcg ccgcgggcag gtttcctccc gggcccggaa gacctccgcc 4680 

acccgccacc ctgcctcccg gcgcgggaag gttacccagc g 4721 



<210> 8 



<211> 2494 
<212> DNA 
<213> Homo 



Sapiens 



<400> 8 

tttctatgaa gttacaaagc tgcaatcccg ttctcatgtg cagcaccaca ggttttgctt 60 

tcaaatatct gcaggtgcat cctcctgcac tcgttaaact aagggttttc cttttctgtg 120 

ctagacccgc ttttacgacc acgctctcca cctgattcac agaaagggtt ctacgaccgt 180 

caggtcaccg ccgcctttgt actttattgg ggagtcgaag gcctcagcgc tgctcgcgat 240 

ttccctaagg tggtcgggaa ggtcccaacc acgttcttct gtaaaccaga taaggaaggc 3 00 

ctggggattc cggccgcgga aaggcacaga ggaggtgaac gctccggatg cccaagtgac 3 60 

gcacttgtag gcggattcgg aaactcgtgc tcttgagccg gaggcggagc ccccagactc 42 0 

agaccgaccc cccagaaccg ggcggccttg cccgaggcat gaaggctttc taggcccgct 480 

tctaaaagct gtgtacaatg gagactcctg gaataggcac atatctctac tgtaatcaga 540 

acgagatgta acatctaaga cgcagggcag ggctctggca aagacacgct gggggaacgt 600 

gaaagaaagg gccaaaacta cgcagctggt cctcagagac caaagactac gacccccagc 660 

attcaatgca gtgaacccct aggacgttgc atgccgggag ctgtagtttc tcaccaccgc 720 

cacccccccg cccccccgcc atctgaaagg gttctagggg atttgcaacc tctctcgtgt 780 

gtttcttctt tccgagaagc gccgccacac gagaaagctg gccgcgaaag tcgtgctgga 840 

atcacttcca acgaaacccc aggcatagat gggaaagggt gaagaacacg ttgccatggc 900 

taccgtttcc ccggtcacgg aataaacgct ctctaggatc cggaagtagt tccgccgcga 960 

cctctctaaa aggatggatg tgttctctgc ttacattcat tggacgtttt cccttagagg 1020 

ccaaggccgc ccaggcaaag gggcggtccc acgtgtgagg ggcccgcgga gccatttgat 1080 

tggagaaaag ctgcaaaccc tgaccaatcg gaaggagcca cgcttcgggc atcggtcacc 1140 

gcacctggac agctccgatt ggtggacttc cgcccccctc acgaatcctc attgggtgcc 1200 

gtgggtgcgt ggtgcggcgc gattggtggg ttcatgtttc ccgtcccccg cccgcgagaa 1260 

gtgggggtga aaagcggccc gacctgcttg gggtgtagtg ggcggaccgc gcggctggag 1320 

gtgtgaggat ccgaacccag gggtggaggc ggctcctgcg atcgaagggg acttgagact 13 80 
caccggccgc acgccatgag ggccctgtgg gtgctgggcc tctgctgcgt cctgctgacc . 1440 

ttcggtgagt gattctggag gagcagacgt cccccctcca cacacgcggc cgcttctcga 1500 

aggtcctggg ggcgttgaac gtgggagggg ggatcccggg gcctgcggtg ggccaaggga 1560 

cgtcaccatt cctcgaaaga gggcaaggga attacgttta ttctcttctt. cctctggaaa 1620 

taaaaaaaga gagcaacatc atctaactgc cctacagatc taatagtatc tcgcccggag 1680 

gtctcagcga cctcggggtg gggcctctct gaggctctgg ctcccccggg agctgtgcga 1740 

gcccctcccc cctccccgcc cggaggactt tttcggccac cgcaaccttc ggccatccca 1800 

gctttcccgt cccctaattc cccagactcc gactcctcgc aagcccgtgt cccttcttag 1860 

atgctggcct cgggagcgtg aggcctgggg ccagcggaat ggcagtgtca gcgtgtttga 1920 

ttcagttttg ctggagttcg gttcttaaca tttctccccc ttctcatgcc tctgggctca 1980 

atttcagttc ccctcgccag aggttgcata aatcctattt gcattctgac ctgtgctaac 2040 

gaaggccgaa gttttggctt gatcagttgg gaggccttgg ctcggcctgc gcgcaacacg 2100 

tgtgggtgta gaagaggcct ggtatcagtc tgcttaacgt tatttataaa tgatcggaga 2160 

cacactgttt agaagtggtg actgcctccg tatttcagga tctactggag acctcagcat 222 0 

ctgaacttaa atagatgttc tgtcttggct ttacctttgg tatttaaacc agggaatgca 2280 

ctctttcatc cccacctccc ttagttacca actgaatttt gcaaccccta cccttcccta 2340 

tttgatcatc ctccttggag aagtgtgatg ttgaagagtt tgcccgtgtt aaatcttcag 2400 

ggtcggtcag agctgacgat gaagttgatg tggatggtac agtagaagag gatctgggta 2460 

aaagtagaga aggatcaagg acggatgatg aagt 2494 

<210> 9 
<211> 2497 
<212> DNA 

<213> Homo Sapiens 
<400> 9 

tgaagactgt gtggagtggg aggtctgtgc ctatttttct cattcattgg ggaatgggct 60 

gagtggctgg gggcaaagtg ggactcaggg gctctgccca gtccttactg caggctaagc 120 

cctcccttgc ggccaagagc cactgggccc tccacctccc tctggcttct acccgcagat 180 

ctagatggtt tcatggtgac tcagaggagg atgcactcgg gcgtcccttt tgaggatggc 240 

tgcagggacc atccagggac tgagccgagg tgcagataca gccacggatt tgggtgttgt 300 

tagacacaga agggtgtgta gcaaagtgag ccccacgcat gcgtagactc ttccctcggc 360 

ttctgctctc catggaaaca caactgatca catcatgatt gatcacggta gccctctagg 420 

gctttctttc ttttcttttt ttttttttcc agaagtcata aagggatata gaatatactc 480 

aaagaccacc tcatgctttt agtggagagg caaaaaaggt tcgctccgga atggggaagc 540 

ggctgcgccc tggacggaga ggggcgggga cttcgcgact gcaggcggag ggagggcggg 600 

tgtcgctggc gcaggcggtg acagggagac accgccgcca ctgagtattc ctatgcaagt 660 

ttcttcatct tcctgtgcat cagtgtttac actggggtaa tgataaatgc tgtgttgaaa 720 

aattatttga tggggccatg gaaggaacgg aaggaacggc gtcctggccc gctcggggcc 780 

cgcgcacgcc gccaccaagc cgcgggggcg ggtcggaggg gagagttgcg tcagccaggc 840 

cgctgtcaga tgacgagccc ggggcgtgac ggggtggagc atccccaaaa aagtgcatgc 900 



ctaggatccc 
agccgtggtg 
tacaatggca 
gtctggctgg 
ggggccaaac 
aggaccggcg 
gaacgtcgcg 
cgcgatctcc 
cctcggtggc 
ggaggtttct 
ggcctctccg 
cccgaaaatc 
catgcccggc 
tgcggctgct 
gaacatcgcg 
tcgccgtccc 
tacccccgaa 
cgaacatcgc 
cctttcctcc 
ggtggaccag 
acatcgcggt 
cggtctcccc 
ccccgaacat 
acatcgctgt 
ttccgccgag 
gaaggctgaa 
gggcggggat 



gcccagtgta 
tgagcctcga 
gcggccgcca 
tcccgggcag 
ctgggaggtt 
cgatctccct 
gtctccccga 
ccgaacatcg 
tcctttccgc 
ggaaggaccg 
aacatcgccc 
gcgatctccc 
tgaaggcact 
gcttgccccg 
ttctacccca 
cccgagtaac 
catcgccgtc 
gatctcgccg 
gggtccgctt 
cgcgatctcc 
ctccccgaac 
gaacatcgcg 
cgcggtctcc 
ctccccgaac 
tccgcttcct 
aaggaattgc 
ccctggtgca 



tccctgcgcg 
agggaaccat 
gccccgcgtt 
gatgggtcgg 
gtggactgca 
gaacgaacat 
acatcgcggt 
tgatctcccc 
cgggtccgct 
gtgctgtctc 
tctctccgag 
cgaacattgc 
cagttcccct 
caggccagga 
acatcgcgat 
gcggtctccc 
tccccgtaca 
aacatgcccg 
cctgcggctg 
ccgaatatcg 
atcgcgatcc 
atcccccaga 
ccgaacatcg 
gtgcctggct 
gcagctgctg 
aggagggtgg 
gacccgcagc 



cggcgggccg 
cagcgcctcc 
ccttctgtgc 
tgagttcggg 
gcgggtttca 
cgcggtctcc 
gcccccgaac 
agacatgccc 
tcctgcggct 
cccgaacatc 
caacgcgatc 
catctcaccg 
ccgcggctcc 
ggcttctggt 
ccctccgaac 
cgaacatcgc 
ttgcgatccc 
gctgaaggca 
ctgcttgccc 
cggtctaccc 
cccagaacat 
acatcgcggt 
cgatccccca 
gaaggcactc 
ctagcaccgc 
agggaccaaa 
cccactg 



ggctgggcag 
tgtccacgga 
ctcccgcttc 
gatgtagcct 
gaggagggga 
ccgaacgtcg 
atcgctgtct 
agctgaaggc 
gctgcttgcc 
gtggtctccc 
tccccgaaca 
aacatcgcga 
tttccgccgg 
agcaccggcg 
atcgtgatcc 
ggtccccccg 
ccgaaacatt 
ctcagttccc 
cataggccag 
gaacatcgcg 
cgcggcctcc 
ctacccgaac 
gaacatcgcg 
agttcccctc 
agtccagggg 
aggctacaga 



ctttataaac 
gctccaggtc 
tgcacctgct 
aagcagggcg 
ggcttctgga 
cggtccctcc 
ccccgaacat 
actcagttcc 
cctcaggcca 
cgaacatcgc 
tcgcggtctc 
tctcgccgaa 
gtctgattcc 
cgatgccccc 
cccccgaaca 
aacatcgcgg 
gcgatctccc 
ctccgcggct 
gaggcttctg 
gcctccccga 
ccgaacatcg 
atcgcggcct 
gtctccccga 
cggggctcct 
gagtgtcaaa 
gggcaaggta 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2497 



<210> 10 

<211> 3095 

<212> DNA 

<213> Homo Sapiens 

<400> 10 



ggtcactaca 
ccgcccgccc 
ggccgccgcc 
atccaaccaa 
ccgcgcccgc 
ctcctgagtc 
aaggtaagga 
tttttacaga 
gccattggga 
aaatcccgcc 
gcacactgaa 
ctgtcccgcc 
ctccctcgac 
gctccccaaa 
ggctctgaaa 
acaaaaacaa 
cgaaggcagg 
gggcagcctc 
agaggcgagg 
cagtttgcaa 
gccgtacgca 

ggggccagac 

gctccggccg 
acaacacaag 
ataataataa 
ttcggcattt 
agatacacat 
aaaggtaaaa 
ggcggcggca 
ctttcctaga 
gctcaattct 
ggtgtcagct 
gattttttca 
ggttccggtt 
tggtatggcc 
ctgtagcaaa 



tccggcagcg 
ccggagctca 
gccgcgcctt 
cctcgcaatc 
tcctcccgcg 
ttgcacataa 
cctcgggaat 
gcttgcacca 
gccaacacag 
acctttctcg 
aggcatcgca 
atcagtctct 
tacaaccaag 
acacaggggc 
ggaaatagta 
aaacccaaag 
gaatctaaaa 
cctttttgcc 
aaaaagagga 
gcgagaaaag 
agcagcagca 
agcccccaga 
ccccctaatt 
ggagggtggc 
taaatacata 
attttaacac 
agaaaacttg 
taatgatcat 
gtagcagcag 
aacttcttgc 
cggaatttga 
ctccgagatt 
ctggccggca 
cgaggctcga 
tgctccgcca 
gctcggcctc 



gggtggcccc 
gccgatttct 
cccccattct 
tatttttgca 
ccgggtcccc 
ggaacgcggg 
ctgaagcctg 
tgggaaaaaa 
agtcacgcag 
ttctcccagg 
ggtgcagtgc 
ctcctcggga 
aaagaataat 
agggaacacc 
gacacgatac 
agttcgcttg 
gaccgaggcc 
tcccggagtt 
gggagggaag 
agggaaaaaa 
gaaagagcga 
ctagcccgaa 
cccctccttc 
aaaagcctcc 
acaatctata 
ctgacagcta 
gagccaaagc 
gagcagcggc 
cagcggcggc 
atcaccactt 
gcttcggact 
tttactgccc 
tatttcgagg 
ggggcctgaa 
ccatcacgtg 
tggaattctg 



tagctcctgc 
gaggctccaa 
tactccctcg 
aaatcactca 
tcagccacgg 
ctggggctct 
gcgtccacta 
taaaataaaa 
cgcccaaaat 
ctgtcctgtc 
gcaccccttt 
tgagcaggga 
tttcaaagtg 
aaaacactcg 
ttcatctcat 
cattttttcc 
gatggaagag 
acccagaagg 
cggaggccag 
cacagccgca 
gagcgcgagc 
tcacccccca 
ctctcctcca 
ccaaaccggc 
tcctatggtg 
gaataaataa 
atttggcaag 
ggcggcagcg 
agcaacagca 
ctaagaaccc 
ttaccactgc 
ctggccaacc 
tcctcataag 
ggtggctctc 
ggctcctcct 
agaactaatt 



tgcccccccg 
ctctacccac 
aggagagcca 
caaagatctc 
ccacaaagtg 
gttcgtcttt 
cgctcaggcc 
tttaggaaag 
acaaacaccg 
gaggttccct 
cccacccacc 
gagcgcgcgg 
ttcaacatcc 
gctctcatta 
ctggatttat 
ttccaaatct 
agccagcggg 
acaggggaag 
gagcgacgga 
cgaatccaga 
gcgcgtcctc 
agcactgtct 
cctcctttcc 
cgattcactc 
ggagagacgt 
atatatacat 
agcggaaaaa 
gcaccagcgg 
ataatcacct 
cagttctaag 
tacgtggcag 
aaaagccctc 
cagagcgtct 
cctccccggg 
ctgtgacgtc 
tgctattcgg 



ccctttctcc 
tccctccccg 
caggttgcaa 
cctttcgcgc 
cccttctctc 
ctcctcgccc 
cgcagttccc 
ggaggcaaca 
cagcggccag 
gagtcccccc 
ccaagaagcc 
aggttcccga 
ccgcccccaa 
ggaagatcac 
gaccaaaaaa 
cggttcggct 
gcgagcgagc 
ggaaggaaga 
gcaaggaaag 
gagatcacaa 
tccgcggtct 
cgtcctctct 
aaaaaccaaa 
aaagacaaca 
gggactaatc 
ttatatcaat 
aaaagaatta 
caacagcggc 
ggtgtccggc 
aatcaacaga 
gggaggactt 
aaagccacaa 
cggatttgga 
cccaagacga 
ggcgccttcg 
tgacataaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



gggggagtgc gctttgcttt cccggggtct ggggctaatt ccttctttct tacccataaa 2220 

ctcagcagat cgagctaaat gcacaaaagg gagcgagagg tttgaaccac tgggaaaagt 2280 

atgttatata tatagtaggg ttagagaggc gagtaagaga aaaataaaat aaaataaaca 2340 

tcacagctct ttccaactag aatattaggc accacgagaa aaatatttgc caagcagttt 2400 

tcggtgggtt catttgcttt atttttattt aggacagggg tttttgctgt tgttctgggt 2460 

ttttttcttt ctggtgtggt ggcttgggat ttttggtttc tgtattttga tggtttatgg 2520 

atttttgctt ctgatttttt gccttttgca agtttgtggt gttacgtaaa tcacaggatc 2580 

ggcatcggtt ggattttttt gtacgtgcct tttctttccc tatctaatcc ctcaagcgtt 2640 

ttaaagatgt attatttcaa tactaatact attgaaagaa gcttaaattt ttggccatat 2700 

gtaacaatcc cagcccccac tttttttttt tttttttcct ttggtgcaat tttctttttc 2760 

ccccttggac ttttgctgaa gtgtgtctct cctgcacttc agagaaatgt tcaaaggatt 282 0 

tgttttggtt tggtttgttt ctttccagga cagcaagtgg tgggtttaat ctgttattgt 2880 

tgactcttgg gaaatttctt gttgcaagaa acgtgtgtgt gggggggagg gtgggggtgg 2 940 

cggggtggta tgtgtgtgtt ttctacaaaa ttctgtgagc caaatacctg tttgtgtttt 3 000 

gttttctctt aaggtcttga gatttttgtt ttcgaggctc gtttcaaggt cgttgtaaaa 3 060 

aaatctcttc agtctgtgtt taagagatca gccgg 3 095 

<210> 11 
<211> 2229 
<212> DNA 

<213> Homo Sapiens 
<400> 11 

tctttcctcg gcgctggctg gtgcgggttg gggtcaggtg gagaagccgc tctttgttaa 60 

ggtgacagaa cgtgctgggg gtgggggccg gggccagggc cggtgcaact agggggccgc 120 

tgccctttcc tggacacagt ggaagcttct tccgcatcac caaatttttg tcatcctttc 180 

tgagggacct gcttccaggc agcacgcaag ttgttgtccc gggtttactc cgcacccctc 240 

tactgggtga ggaaggagca tcttgaatgg agatgggggt gtccccggtt tatacatctg 300 

cagagaagag gtgtgccggg ctgcacctct ggaggccgcg gtaactgata ttagagaaga 360 

ccccggttgc agctgggaag gctcactggc tggaaagagg tgcctcctcc ttccagcaaa 420 

gggccctgtt tggaagggct gcttctcacc tgtctagtgg caccacagga cggtcggctt 480 

ccactcgaat tcccccggac ggtatcatca catagccggg tcctcgcagt gttggtttcc 540 

caatccgatg actgtcacct cggtgaggac ctgtgctgat ggccggagaa ccctgcgctg 600 

cgggcgcaca tggccaggtg gcgcctggca ggcgacgtcc gggtgcagga cggcgctctt 660 

accgccccac cccaaaccgt tgcctgggcc taggtccttc ggcttcctga acaggggttt 720 

ggggggctaa ggacgctgag gctccggggg caggaagttc tctctggtta agcgttctct 7 80 

cttctctccg gcatacactc ccctacccac ccacctcgcc taccctcggg gcgagaggct 840 

caccaaggca gggcgcgccc cccccatgaa tcatcccaag gcctctgagc cgcgggggct 900 

ccgggcaact atccccctcc tctcctggcc tcaggcaccc cagtccaggg gtctgcagag 960 

aagcccgaag cccggacaaa cgcgccggac gtcaacaacc tctcatccct ggcagcagca 1020 

aaggccaata tatttccatt tcttatttca gtttgccacc aaaacaaagc tgcgcgcggc 1080 

tgagggcagg aaggcgctga gaccgagaag aagggacgtc ccggagaaag tgcgcccagc 1140 

tgatcttaga aaccagagtc ctccgggact tcgccgagat tttctgtagg gcgttttaat 1200 

ctgttttcct actgcgtgcc ggcgtcgcag cgcgtgcggc tcagggcttg gtgactccgg 1260 

cttagcccgg cggtcgcggc gaggttcctg gcgcagccgc ttggaacttc gcattagaat 1320 

cgggaccgcg caaatgccct ggctgaagtg tcaccctatt caagaaacac tgctgtcagg 1380 

aacaaaatgg ggtccccggt gctccgaagt atcttctgaa attttcttaa aacaacttac 1440 

aaaaaatgtt tttgctttaa cgttttacaa cgtttaagga aacatgtaaa tggtctgttt 1500 

ctttatcgag atggtcgtcc taactaacag tgtacacata cataacaatt cttccaactt 1560 

tcctcctcag agctaagcac ttcactatat gtaaattata ataaagaaaa gattgtgcaa 1620 

gatcatgcaa gtcgattgac ttaaaatatt gagttttaat ccaggccctc tgtttttcta 1680 

tttaacaact tttgtgtttg gaccagactg gtgaagcagg ctatggaaat taacaaagta 17 40 

aaaaattaaa agcatcttcc ttcgccatcc ctccctccaa aattaaacaa cagtcgcccc 1800 

ttcctgagca ggcttcagtc ccaggctcga gttttcctgc gatcacccca cagtcaccca 1860 

cagcagctgt tgctgcttct gtcgggtttt cgtttctgcc ttctttgggt cgtctcttgt 1920 

atacaaaaca caccccagtt ctctaactaa attcaaatac gaccccggca gaatttacac 1980 

atttcgtggt gcatggattg tgtcggtgca ggggaaataa ataccctctg gtatttaacc 2040 

actgagtcta attcgaaaaa tcgggactgg gcccctaggc ggcaccccag gggctccaac 2100 

ctggcccgcg cctccccaga ccttggcgct gagagcgctg cttttgcggg tgggtggacg 2160 

gagaggtaac aatctgcttt caacaaaaac ctgtcgccac cgaatcgaaa gcgaaaggga 2220 

agggagaag 2229 

<210> 12 
<211> 2820 
<212> DNA 

<213> Homo Sapiens 
<400> 12 



cctaggggac aatacccagg catgttaacg gagtttaaaa tgccaaggaa attacaccac 



60 



aattctgccc agtatactac aggctgtcaa accgaaatgc tatgccagct aggagtgcag 120 

caactcccat cctctggccc tatttaatta ggaagcttca gcagagcgaa gcctgccaag 180 

cgttcgccgt cagaatctga aggaacccga gcgagcaaga agagtgcctg acccactcca 240 

cagaagcctg tccagaaatg gaggagtcag cgcccactga agtcggttcc gccctcggct 300 

cgcctacatg gagcctgacc agcctcagtc atgcccactc cggcctggga gacccgcaaa 360 

gtgttctttt tctcaactcc cctgtactac cttgaagctt agggaagcaa agagaggggc 420 

atatctggac tgcaaaacca atgtcttttg ccgcctagga gagaagggaa tgagagagag 480 

agagagatag atagatagag agagagagag agagagagag agagagagag agagagagag 540 

agaaattcta ttgaaaccca gctcctctag aatctgtgtg acctggtctt caacgggaga 600 

ccagtgcgac ctcatggcac ctttgccagg aatcagcgat tcccctgcag tcaccatttg 660 

atttattgct ttctcgctca ttctttctca taaagttatt tcttcctcat cctagtaaga 720 

cttttttctt taatgatgac aaagcttctg tttcagtgtt tcccctagga ttggtgctct 780 

ttcaaaacag tgaacccaga aaaccatccc gtttaatatt tctcaaaatc ctcgcagctc 840 

caatgtaagc gcaagcatgc aaaggtttcc tgctacacct gcactttctg cccatcccag 900 

aaccacccct cacccccggg cctgcaacag ttccccttgt ttctctggat agaggtgggt 960 

ggtattaggg gtctagggca gtaggaggtg aggggctgag gaggccgcta gggtaggctg 1020 

gtctgtgctg gatacgcgtg ttcttctgcg gagttaaagg gtcggggacg ggggttctgg 1080 

acttaccaga gcaattccag ccggtgggcg tttggcagtc acttaaggag gtagggaaag 1140 

cagcgagctt caccgggcgg gctacgatga gtagcatgac gggcagcagc agcagccagc 1200 

aaaagccctc gcaaagtgtc cagctgctgc actgccgcgg ggactcccac agcaccatga 1260 

ctagttcgtg caactctgca gcagcaaacg gcttccgagg aacacaggat cgcgggggcc 1320 

gggcagcggg ctactgagca tcccgcggac ggcggcagca gaggcggcgg cggtggcagt 1380 

ggcacccggc ggggaagcag cagccaaacc cgcgcatgat ctcgagagtt tcagcaacat 1440 

ccagggactg ggctcagccc cggagcgaga gggtcgtccg ctgagaagct gcgccggaga 1500 

cgcgggaagc tgctgccata aggagggagc tctgggaagc cggaggacag gaggagacgg 1560 

gagtccaggg gcagacgagt ggagcccgag gaggcagggt ggagggagag tcaaggcgcc 1620 

ccgcagcccg gcagccgcct ctcgagctct gccgcccgca tccctctggc gtttgggaag 1680 

cagcaggtcc tcagcccgcc cggggtcacg tgggaagagg cagtcgggct ctgattggtg 1740 

gagcaggatg caggtcccgg gagggagggg tcgacgagga ggtgcaagga tgcaaggagg 1800 

aggcggccgc ggaagccaca gatgggctcg ctcgccaggc gctggcccga gtggggctag 1860 

gcggggatgg ctcaaatgag aagctcgggc ttcagggtgg gctacccgca cactcatata 1920 

ccattcgcct cactctccgc tccaggacgc cccctaccga aggcggggtc cggactagcg 1980 

cccctcttcc gcgcgtgacc ccgggccgcg agtgcgggcc gcggctgggt ggcgtctctc 2040 

cgagctggag atggtggggg cggaggtgtc agaggagcag cagcagcagg gcagagaggg 2100 

gcgagtcggc gcgggagagg gcgtcctgct ggcgaccggc gctccagcgt gcgggagcgc 2160 

gccgcctagg ctgtaggggg atgcaggctg ggaatgtcgc ggcggagagg ccagggacgt 2220 

ttctctaggg atttacagga aagagggtga gaggcgatgg tgttagaacc gctcttgccg 2280 

acctggaagc aacagcagca tctcccacaa gagcgtgcaa ccccaaggct gctcgccgag 2340 

gcagctcagc catcccggca ggcgctctcc ttccttctct cttctcccct ctctcctccc 2400 

aggccccccg cagctccgac ccagcccaag cgttcgcagg tttgaatccc tctcctcatc 2460 

acccgctcct ctccagcccg tagcctatta gtgtgtccac ctgggaggtg cggtcagatg 2 520 

tgtttggaag gtcagattgg tcgggacaag tggtctgaga gaaagagaaa ggctcctctg 2580 

catacgccgc gggtgggttg ccgggagcat cggccgggca gcggcgtccg ggaaggggag 2 640 

agcgggctcc atttgttggc ccaggcagtg accctgcgtt ccttactcgg gtctttgccg 2700 

gatggccggt gacctggggc gacgagagaa ggtctaactc ggcaggagtc tctggctctg 2760 

cgcgtttctt tcattctctc cagcgggaag ggcaaacggc atagcgggac ccgccttccg 2820 

<210> 13 

<211> 2280 

<212> DNA 

<213> Homo Sapiens 

<400> 13 

accagcacag aattgctacc ataggatcgc agcctaagca ctagagtgac attaattggt 60 

catgcttaac tgcctcaaaa tcatttttta aataattaca ctgatactat aatagaaatc 120 

atgggtactt attttacatt cagatggaag gcattattgg atatgtatta aaaaaaagac 180 

cccctgaaaa aaataaaata aaataaaaca tcaccatcaa aataaaagaa cccaaaacaa 240 

cccctaaaaa cttccctcaa caaaatacat tgttaactca taaaatggac tgatgactag 300 

ccatgcaaat gtcctaaata aaacctttac atttttttca cagttaactt atgctctgaa 360 

ctgcctaccg atcacaaata atggcgaaat ggcactttct gattatactg tattttgttt 420 

atagaaagtt tgatacgatg gaacttatca ggtaagaggg tgggtgctgt gaacgagatg 480 

ccgtctccag tcgcgggggc gggcagagtc cctggagcgc gtggattcca tgcgagccat 540 

gcagcacttt ttgttttttg tcagaagtca aagttactta tttacaatac attcatgcct 600 

tcgtgcaact gcccatccct gcgtagccaa cagggagcca tcacggggct aatccacagg 660 

ggaaaaatag atatctatct ctctatatag atgtggatat atgtatatat gtatagaacc 720 

gcggcatcca accccacagg ccccggggcc gagggcgagg gcactgtcag ttcttccagc 780 

agggtcgtgc tgggctttct gtcaaaaggg gctctcagca aagagcgagc tggctgcgct 840 

ctcccagctc tccacagtct gctctttgtt tcaaggaggg agctaagtaa ggggtcaggc 900 

cctttcggtt gtgcgagctc acagttattt atctacttat gcccatccag gctgatggcg 960 

cggggatttg ggtacacgcc cctccagccc ccggggtgcc tgcggtgggg aaggatgtat 1020 



cgccttccct ctgccctccc ctattggggt tggggtctta gtctgagagc gagtgagagc 1080 

cacagtcata cactctgtgg gccccatctg cgttgtaagg cccattgtgc cagtaggaag 1140 

agtcacagac tgtctgtagg gaattaatct cggacgcgga ggagttggca tctcgtctct 1200 

tggaccgctt tcggttcctc aggataaaca cgagcatgcc caccacggtg aaggcggagg 1260 

tgacaaacac cagcagcagt cccgggacca acaccgagat ggacaccctg ctggtgtcta 132 0 

ggtaggagtt ggagtgcgtc ccggtctccg ccaacccagt gctgttttta ctgtgcgaag 13 80 

ttaacgtggg cgagatccta gcgtacagct gagggcagat ctcgtcattg gagaggagca 1440 

tgaaatcctt tctaaagaag ttcaccggcg tctcacactt gaggtcgctc atcagcactt 1500 

cggaacccaa gcgttctgcc cactgcttga aaggcacaat tgtgcaggag cactcccagg 1560 

ggtttccgtg gaggtctatc tggatgatgg aggttaactg gtccagcacc cctgccaccg 1620 

ggaggtacat gaagtaattg ttgtgcaggc tgagtttaga gagcgagacc ccagcgaaca 1680 

cgtccacagg cagggacctc agcaggttgt tgttgagaat gaggatcctc agtttgggca 1740 

tggcattgaa agtgcccggg aggatgagct ggatagcgtt gtactccacg ttcaggtact 1800 

ctaggttttg cagccccgcg aatttctccc gggacagcgt gtccaggtaa ttgctatcca 1860 

tgtatagcca cctgaggtcc aaaaggttct tgaaagtgtt gttctctaca gtagcgatgt 192 0 

tattgttgcc cagatccaac agaatgaggt tcttgtaatc cacaaagtgc gattttcgga 1980 

tgctgtggat cttgttatct cgtaggaaaa gctcctgcac gttagagagc ttgggcttca 2040 

aatcagccaa gctgctcacg ttcctgttgt tgcagttcat ctttaaaccc gaccctggga 2100 

tgtggtcgca gctgcagccc ccagggcagg gtaaactgtt agctaagggt ttgttcctgg 2160 

agctacccgt cgctatcgct gctgtgggtc tgattttgat ctgccagttg cctgggatct 2220 

ttgtacctcc gtttggagca gaccctggtg tggcatgatc ctcttgccca tttgtcttga 2280 

<210> 14 
<211> 2470 
<212> DNA 

<213> Homo Sapiens 
<400> 14 



ctttgttttt atgtttagtc tattgaagct tgctctcagg ccactggagc tgagctttct 60 

gaacacttct ggttcagagt gctgcagatt tatgaattgt tgtttgctga aataaggtgt 120 

aaagtttatt ttgtctaaag tttttaacag gagtgataat tcaggcaaaa tgttcacata 180 

atagtgggaa atcaaaagct tgcgatacag aaaagtgata tatatttgta aactccttta 240 

acaaaggcat cgtagcataa gcatgactgt agagccagac tgtcaatgtt caaatttttt 300 

ctgggcctta atttctttat ctgaaaatcg ttaacagtgg aacctagttg atagggttgt 360 

tgtggagcaa gcgttcttaa tctggagtcc atgaaatttc agcatctgtg aactttgtat 420 

tgaaaaaatc gtaactttat gtctattaac ctcacaacag tttcgtttct ttgaattata 480 

aaggcaggca acaaaccaca gtagtattat cagtacctac aactttgttt cctacagaaa 540 

ttattttcat actcagtaag ttgttgcaga tatcttgaac aattatacgt atatattcct 600 

taagtattat atatataata tttttatatt ctgtatatat ttttggagac agggttttgc 660 

tctgtctccc aggctggagt gctgtagcat gatcggaact cactgtaacc tcaaactcct 7 20 

gggctctagt gatccttcca gcctcagtct tggaatagct aggactacaa gtactcccca 780 

ccatacccag ctgttaattt tttttttttt cgtaaagaca aggtctcact atcttgccca 840 

ggctgctctc caactcttgg acacaagcct cctgcatagg tctcccaaag cattgggttt 900 

acaggtgtga gccaccgtgc ctggctaata ctatgtattt aatttcatgc gtttagaaat 960 

atgtgctatt ttaagaaggc gtctacaggt tttatcacat taccaaaggt gttcatggca 1020 

caagtgaaga aatctctaga tactagagta ataaaaacca tgatgcatta gaacggatca 1080 

ggcccatatc aaacgttcaa catacagtct gctgctatgg ttcttttttt tttttttttt 1140 

ttttttttga gacagagtct cattctgtct cccaggctgg agtgcagtgg agtgatctcg 1200 

gctcactgca acctctgcct cccgggttta tgccattctc ctgcctcagc ctcccaagta 1260 

gaggaactgc aaggcgctcc accatgcccg gctacttttt gtatttttag tagaaatggg 1320 

gtttccccat gttacggcag gctggtcttg aactcctgac ctcaggtgag ccacctgcct 1380 

cagcctccca aagtgctggg attacaggca tgagccacca tgtccagtct gctatggttc 1440 

ttaaaatatt catagaatag aaattgaagt aagtctattc gtttttatat taaaagtagc 1500 

tagtacggtg ctctgtactt atgcactcca tggatgctga atgagtcatt gaatgaagtg 1560 

gagtctaggt tgtgcctcag gccttccagt ggggctggaa ggaacagttg tgaagttgtg 1620 

gggcggcttc tgtgcatcgg gggactctgg agttagatgg tttcccaggg ctctgaagga 1680 

gaagagctgt gttcctaata atgatctcaa tgggaataga gattgctggg gaccgcaggg 1740 

gccaggttgt cattccccaa agcttcccct tcatcatcca agaaggcatt caggtctttc 1800 

tgtgctaggc cccaggtaaa gtgctggact acccagtaat tgggttcagt agcaggatgg 1860 

cctcagattg aggtcccagg gccaaaggac cactcctctc ctcagcgctg gtccgggaaa 1920 

ggcaagctcc gggcgggagc gcacgccgcg cccccgaagc ctggctccct: cgccacgccc 1980 

acttcctgcc cccatcccgc gcctttccag gtcttctccc ggtgaaccgg atgctctgtc 2 040 

agtctcctcc tctgcgtcct cggccgcggc ccgggtccct cgcaaagccg ctgccatccc 2100 

ggagggccca gccagcgggc tcccggaggc tggccgggca ggcgtggtgc gcggtaggag 2160 

ctgggcgcgc acggctaccg cgcgtggagg agacactgcc ctgccgcgat gggggcccgg 2220 

ggcgctcctt cacgccgtag gcaagcgggg cggcggctgc ggtacctgcc caccgggagc 2280 

tttcccttcc ttctcctgct gctgctgctc tgcatccagc tcgggggagg acagaagaaa 2340 

aaggaggtag aatggatccc cttggccttc ccctgtgggc gggggcgggc cagggtgggc 2400 

cgcgttgcca ggcagccctg ccgtgttgct aggcagcctg gtcgccggcg tgggcgatgc 2460 

cggcgctggg 2470 



<210> 15 

<211> 5907 

<212> DNA 

<213> Homo Sapiens 

<400> 15 



gccgtcggag cagggcccca gcgcaccgcg agcctctgct tcctccgcct tgcagcgccg 60 

cgagcccttc tttcctccgc gcctggcccc gtctgcgccc ctctcggcgt gggcattcgc 12 0 

agacccgcta ggaggctgac gagaggtgtc ctcatagagt cctgccttct tccagacccc 180 

caaacgaaaa gaaagagaaa agccaacctt tcgcctccat tctgcacgca tttggagagc 240 

ggctggtggc gagcgatccg cacaatcgct tccctcgcgg cgtccctgga cgggcggaga 300 

gaacccgggt gactgagcct gaggctgagc ttccttggga gcgatccatg ccagccactt 3 60 

ctaccttcaa gggggactcg cggccgggtt tacgcgcgaa ggatgggtgc cagggaaact 420 

gacttataga cgacccgctc cgcgagagac tgacaactca gtcctgcggg ctttcggggc 480 

gcccagacca gtccttagcc agaggatctg gggctgcaca gctcccggac cctggagggg 540 

gtggtccggg ggtgggagac tgcgagcatc agcccacgcc ggccccctcc caccctgcgg 600 

cggcgaggtg ggaagggcca ggatgcgagc ttagcacctc ccctagaaat gcagcacttc 660 

gcccccccac ctcccctgcc ctcggcgctc cctctttcac gcgctccctc ccgcctcctt 720 

cccaagggcg cccttcttct gcccccagct cacgtctgaa tccctcggcg ccccctttct 7 80 

cttctcctag ccccttcctc acgtcccctg cctccgggta tttcctctct ccaatccccc 840 

acccccgcac cgcctgattc cgaggggcgg gagcgcattg ggctgcgcac gggtgggggc 9 00 

gccgcgccag cttcgcgtag ctgctctgac gccgctgccg ccgccgccgc cgccgccgcc 960 

ctccgcagcc cagctcgcgc cccgcggcag ctccgcagtg cactagccac caccgccgcc 1020 

gccgccgctc cgccagacct gctgccagct tgcccggtcc agccctgaga gagcctcgaa 1080 

cgccagctgc gagggtcatg agccagagag ccccggggcg ccgcgcggag agcaagcgga 1140 

gatagcgact ttgcgccccc cagccctcgc cttcttgcat cgcgttcccc gcatcctcgg 12 00 

gtccttctgt cctttccgct gtccccaccg ccgccatggc caccttgctc cgcagcaagc 1260 

tgtccaacgt ggccacgtcc gtgtccaaca agtcccaggc caagatgagc ggcatgttcg 1320 

ccaggatggg ttttcaggcg gccacggatg aggaggcggt gggcttcgcg cattgcgacg 1380 

acctcgactt tgagcaccgc cagggcctgc agatggacat cctgaaagcc gagggagagc 1440 

cctgcgggga cgagggcgct gaagcgcccg tcgagggaga catccattat cagcgaggca 1500 

gcggagctcc tctgccgccc tccggctcca aggaccaggt gggaggtggt ggcgaattcg 1560 

ggggccacga caagcccaaa atcacggcgt gggaggcagg ctggaacgtg accaacgcca 162 0 

tccaggtaag cgcgggattc ccagttctgc ctgtcctccc ccctcccagc tcagcgtgcc 1680 

gggctctgcc cccgacagtc gcccggtgat ctcggcctgg agaccccctc ctgtacccag 1740 

gaatctccct ttctccatcc ctcccagccc tgcgcgggga cctacgcccc caggcggtgt 1800 

tttccgccct aacccacgct ccctcccaac ggcaccagct gcaagaccgc taggctgaag 1860 

ttcggtctga gacacctgtc cggagacact gcaaaagtga aggaaatggg gggagggagc 1920 

aggaagcgat gagaaagaaa gaaaatcagg attggagggc acggtttggt cttggactct 1980 

ggaacggatt cacagctgca ttttfcgggag gaaagaagaa ggggaaatcg ctgaggtcgg 2040 

agtctcctcc ccccgcacac acgcatagac acgcacacgt atataggcag gcacaccaca 2100 

tgcccacagc cccccttccc acgaacaaag gcgcaggcac aggcgcacac gacatgtatc 2160 

tatatataga ccacagtact atccccaaat gcacctgcgt acacccctac agctccacaa 2220 

acaaaagcat atacctgcac ataccaaaca cacgaatatt cgtttgaaca tatgcacaca 22 80 

cacatgcaca gatgcatgca gacaggaacc agcctgctcc cttgggtgcg gccctgttgg 2340 

agacatacag gtatacacac acgaaagcac acacatgctc ccacacaaac acatttagtc 2400 

cgcaataatc cagcattacc tgtgcacaga ctcgcccctc tcgcccgaaa tcccggctgc 2460 

accgcaaaga ggcggccagc tcctccccgt gtgaccctgg cgaagccctc tccagaggct 2 520 

aagaatggcc cctgctctct aggccctctc cccttattcc taacccccca ctccccacca 2580 

ggcggcgcag gactcctaca tagggagcag gtttggagtc ttggcaacga cctccgagct 2640 

agcaggggct ggcaggccgc agctttctgg gccgaaggag acttcagctc taggcaaccc 2700 

cggtccctta cagcttggcc ccgcggcgtc cctggagcgc actatcactg gggccacgga 2760 

aggcaggttt tctgggagca gaggcctccc aggggttgtt ccatgtatcg gggtaagcag 2 820 

gactccaccg ggccccgaac accagatggc ccgacccaag gcgctttccg ctccggggcc 2 880 

cggcgagacg gccttgctgg atcacggacc cggggctgct cccgccttcg ctgtgccccg 2940 

cgacttcccg gggcgtcttt caaggctccg tttgataggc cccaagggag gccagggcag 3000 

gaggctggag gctcgtaggc ctccgggacc ctggatttcg ttagctctta gtccccgtgt 3060 

ggattctaaa ccctacagcc ctgtccgcct gaccctgaag cttctggttc tagactcttg 3120 

ctcggttcgg ctttctggcc tccctggcgc cttcccgcca agctcctcct cctgcttgcc 3180 

tccccagcct ccacttagtc cccttcctta ggtctcgccc tcgcctcact ctaactccag 3240 

ccttgtccta agactctcaa tttccagccc tgcttcctcc tccccgcgcg ctgctcccca 3300 

agcgtcttca cattgtctct gttcttgcct cccaggggct ttgccccacc ttcagcccct 3360 

ctccctctct accctatttc cttttcgcct cacaactgca gcctttaacg tttcctcctc 3420 

tggtctgagc ctgagacgcc cgagttttcc tctttagctc ccgtgcccca ttccagcccc 3480 

accacaaatc ccgtgcccac tctttccact ggcccaggcc cagctcacta cctttcttag 3540 

agcccctttc cgactacgct cccttctgtc ccttctcatc tcctccccca actcctcatt 3600 

ctgttcctgt ccccagaaac catcccgaag tatctccctt ccctcctgtg gcttctccct 3660 

attctcagcc ctctcccacc ctagccctct cgctatcacc agccattagc gcacctttcc 3720 

ttctcagatc cgtccataca tccctccctc tccatctctg gctcgacacc cgggctcact 3780 



attagttccc 
acccaattct 
ggcccctcat 
agggcatgtt 
ttctcatcat 
tgtacgagga 
ccaacgcctg 
agatcatcga 
acaacagctt 
tgctgctgcc 
gcactctggc 
gcgactgggc 
ttggcatcat 
tgcagcagcc 
tcaagggcct 
tcacggataa 
cgctgttgtc 
tccaggaagg 
cctgggggct 
tgccgcactt 
tcttgctgcc 
tcttcttcga 
actccctcga 
cgagcccccg 
cagtgcgccc 
tgattattcg 
gtttctgttg 
gcgggcaggg 
ggggttggga 

gaatatttca 
cgcttcccat 
tttgggtggg 
tgcctgccgg 
gttctactgt 
tctatctcta 
cttgcgcggc 



ctacatccac 
cagtgtcctt 
ccgttgccaa 
cgtgctgggc 
cttcgccgcc 
gaatgaagac 
ctgcgccccg 
gctggtgatg 
cccggggctg 
ttgcgccttc 
ccacttcgtc 
ctgggagaag 
cgtgttcagc 
cagcgagttc 
cttcgcgctc 
cctgcccggc 
ctatcctctg 
cagccgcgcc 
gacgctgcgc 
cgcgctgctc 
cagcctcttt 
cgtcgccatc 
gggcctcatc 
ccgcgcttct 
tgccgccgcg 
gggatggggg 
tcctttcttt 
ttctctgtcc 
agggagggag 
catccatcca 
tccggagacg 
caatttcctt 
ttttcaggaa 
gcgtgtggtg 
atttagagtg 
ccggtgtcgg 



cccgcagcct 
agcgccccct 
gttcgctgag 
ctaccctacg 
gttgtgtgct 
ggcgaggtgg 
cgcttcccaa 
acgtgcatcc 
cccgtgtcgc 
cttaagaacc 
atcaatatcc 
gtcaagttct 
tacacgtctc 
cactgcatga 
gtcgcctacc 
tccatccgcg 
ccattctttg 
tttttcccgg 
tgcgcgctcg 
atgggcctca 
cacctgcgcc 
ttcgtcatcg 
gaagcctacc 
gcgctctctc 
ttgggaggcc 
ggatgggagg 
tccacaacac 
ttccaagtgg 
agggggcgca 
gagctcggaa 
tttcaaccct 
caacgaagcc 
tctaaactct 
tgctcgtggt 
cggtacataa 
gtttgtg 



gctcttaacc 
tcgggccaca 
cgtccgcgtc 
ccatcctgca 
gctacaccgg 
tgcgcgtgcg 
cgctgggcgg 
tgtacgtggt 
agaagtcctg 
tcaaggccgt 
tggtcatagc 
acatcgacgt 
agatcttcct 
tgaactggac 
tcacctgggc 
ccgtggtcaa 
ccgctgtcga 
cctgctacag 
tcgtcttcac 
ccggcagcct 
tgctctggcg 
gcggcatctg 
gaaccaacgc 
ccttctcccc 
aagctttaaa 
ggacagggat 
cctggttttg 
ggccccgaca 
gctcgcaggc 
tctacagcgt 
gcagcgggaa 
ggaaggcgag 
catcttgtgc 
gaataagatg 
ttatatccgc 



tctcctcccc 
gcgttaagcc 
tggttgcctc 
cggcggctac 
caagatcctc 
ggactcgtac 
ccgagtggtg 
ggtgagtggc 
gtccattatc 
gtccaagttc 
ctactgtcta 
caagaagttc 
gccttcgctg 
gcacatcgca 
cgacgagacc 
catctttctg 
ggtgctggag 
cggcgacggg 
gctgctcatg 
cacgggcgcc 
caagctgctg 
cagcgtgtcc 
ggaggactag 
tcaccccgcc 
catctctggt 
tcacgatcca 
gggggaggcg 
ctttggttcc 
gtggcaactt 
ccagccattt 
aggctgactg 
aagccgcggc 
aatttatcag 
aaatgtatat 
aaataaagaa 



ggcggctcag 
acgccccccg 
tccgccccac 
ctggggttgt 
atcgcgtgcc 
gtggccatag 
aacgtagcgc 
aacctcatgt 
gccacggccg 
agtctgctgt 
tcgcgggcgc 
cccatctcca 
gagggcaata 
gcctgcgtgc 
aaggaggtca 
gtggccaagg 
aagtcgctct 
cgcctgaagt 
gccatttatg 
ggcctctgtt 
tggcaccaag 
ggcttcgtgc 
ggcgcaaggg 
cccaccagcc 
tcctagtttc 
tcgcgtctgc 
gggtgcattt 
agtcatcgag 
gaccttgggg 
ccagcaagag 
ggaaatccat 
ggggccagct 
gtgtggaact 
cagaaaaaaa 
gagacaaagg 



<210> 16 
<211> 3049 
<212> DNA 

<213> Artificial Sequence 
<220> 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5907 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 16 



tattcggtgt 
taggtgttcg 
gtttgtttta 
tcggggtttc 
tcgtaaacgg 
atggtgtagg 
gattgggacg 
tttagcgtgg 
ttaagcggta 
ggcggtttcg 
aagtattcgg 
tgtaagtttc 
aagcggtttt 
agtagcgagt 
acgtttagtt 
ttgtcgtttt 
atagagaggt 
tcgttttttt 
gtcgtcgtta 
ggtgcgagcg 
cgcgttttag 
tggtgtcgtg 
aggtggggag 
tttatttcgg 



agttggttta 
tggagttgta 
ggaagttgaa 
ggttttcgtt 
tgatgcgcgt 
agtcgcggta 
gcgtgagtag 
gcgacggcgt 
gtagttcgtt 
ggtcgggttg 
tttcgcgttg 
gtaggtaatt 
tgttgtttcg 
cgggtatcga 
ttacgatgtc 
cgtcgtcgtt 
ggtcgggtcg 
ccjgagtttcg 

tcgtcgttat 
cgtcgttgtg 
gcgagtgtcg 
gcgcgtagga 
cggggattcg 
ttcgcgagcg 



gattttgttt 
cgttattatg 
tttgaggtaa 
tagggtgttt 
tattcgtcgg 
gtcgatggtg 
cgggttcgtc 
ttcggtagag 
atttagcgag 
ttggggcgtt 
tagtcggttg 
taggatgtag 
ggttagtttt 
tattatcgtg 
ggggaagagc 
ttcgttgggt 
ggatagtggt 
gaattcggac 
cgttatcgtc 
cgtttttttg 
ggtcgcggtt 
ggcgtgggcg 
atttgtattt 
ttgggggcgt 



tcgttttggt 
tggaagttcg 
tagcgcgagg 
ttaaatattt 
aatttggcgt 
atgtagttcg 
gcgttttcgg 
gcgttttttt 
ttttttttgt 
tcgaggcggc 
cgttcgggga 
cggaagagga 
tgcggttgtt 
tagcgtcggg 
ggtggtttcg 
aatttacgtg 
aggaagtcgc 
gttcgtttag 
gttatttttt 
agttttagtg 
cgcgcgttgt 
gcggcggagt 
taaacgcgga 
tcggattttt 



cggtgttgtt 
attcggatag 
tgtagcgttt 
ttttggttag 
tcgtttgcgc 
agcgtcggtt 
acggattgcg 
gttgttcggg 
gtatttcgcg 
gtacgagttt 
agtagtcggg 
agtcgtttcg 
gttgcgtgaa 
ttacgtatat 
cggaggacga 
tgttgttcgt 
gttgtattta 
ttttgcgttt 
agagtcgcgc 
cgtttcgttc 
ttaagtagag 
ttcgtttatc 
cgttacgcgg 
attcgttttt 



ggtaaaggcg 
tttgtcgaag 
cgggggacgg 
cgacgttttt 
gtttcgttcg 
gccgtttagc 
gttagttagt 
tttcgagtag 
tcgcgagggc 
tggtagttcg 
tagtacgagt 
gtttagaaag 
tatgcgttag 
ttggtttttt 
cgaggagtta 
tagagttatg 
ggagttgtaa 
cgtcgtcgtc 
ggagtcgagc 
gttttgttcg 
tcgtttgttt 
gggcgagggg 
ggttgttggt 
tatttattag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



tttcgtttcg 
ggcggtttcg 
tagttcgtat 
gagtggaaat 
tgtagttgcg 
tttggagaga 
gtagttagtt 
gtagtaggat 
ttaaatttgt 
gaagtgaagg 
atcgcgtttt 
cgtttgtatg 
atttttcggt 
gtttttgttt 
gattgtagtt 
tggggttttt 
ttatatttat 
gaatgttgta 
tgttttttgt 
ataattttta 
tattatatta 
ttgataaaat 
aatgttattt 
gtgttgaaat 
ttgaatgtat 
taatataaaa 
tattagtgtt 



tggtttcgtt 
gaagatagtt 
ttaaggattt 
gcgtggtcgt 
gtttgttatc 
cgaaagagtc 
tggaaaataa 
gttttacgat 
ttttaattgt 
ggtaagattt 
tagatttata 
tttatttttt 
tttgtttaag 
ggattttatt 
tatagaatcg 
ggatggtttt 
gagtgattat 
aaaagagaag 
gtatfctgtta 
aagcgttttt 
gtttgaagta 
ttattttttt 
ttattttgaa 
gagattaatt 
tttaatattt 
tttttaagtt 
agagatttga 



taattttttt 
ttttggaggg 
tggaaggagg 
tttcggtttt 
gttggttgtc 
gttttagttc 
ataaataaat 
agtttatttt 
aatttttttt 
cggaacgttc 
atagttgtag 
taaaagttta 
ttttcgttag 
ttttaatttt 
tttttttttt 
ttatatattg 
ttgattattg 
agtgtagagg 
ttatttgtag 
agtgtagttt 
tagattatat 
gtgtaattat 
atgtttatat 
tttaaattag 
tttttttgtt 
ttagtgtagg 
taaaattagt 



tttttttcga 
atagttatcg 
acgtgaattt 
ttcgtttgcg 
gtgttattga 
gagcgagtta 
ttagaaataa 
ttggttgtag 
tttcgtaggg 
ggttttcggg 
atttaacgta 
ggcgtttttt 
ttttagaaag 
tgttgtttat 
ttggaagttt 
tttgcgttgt 
atttgggtat 
agggtgttaa 
ggtattaggg 
aataagtaaa 
attaggataa 
ttagtgagtt 
ttgtttaata 
ttattttaat 
gttgttttgt 
atttcgaggt 
ttttttttta 



agaggattcg 
cgtttttttt 
tcggaagttt 
gaattgggcg 
aagtagagag 
gattttgttt 
aaagtaaatt 
agataaaatg 
ttttcgaggt 
ttttatttag 
atttttaaag 
ttttttcgta 
ggatatttat 
acgtttgttt 
tttttggttt 
ttttagtacg 
gaaaataagt 
gaaatgattg 
agtgaggaat 
gggttagggg 
attaggtaag 
ttatagtttt 
attatcgcgt 
tgtaagtata 
tttttgggat 
ggttgtgtta 
taaatttaa 



ttcgagattt 
tttttcgttt 
gtcgtttcgg 
tttttagtag 
acgcgaaaaa 
aagttgtttt 
ttagtgtttc 
agatgaagtt 
attttatttc 
ttttcgttga 
ttatttttgg 
tgtttcggtt 
atatttgaag 
ttttttttcg 
tttaggtggg 
ttttgaaatg 
aagagttttt 
ggttattgaa 
gtaaaggcga 
tttgcgtagt 
atttttgttt 
tttatagata 
ttatgttagt 
gattttgata 
aaataataga 
ttaggcgtta 



<210> 17 
<211> 3049 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 17 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3049 



ttggatttgt 
agtataatta 
ttttaaagaa 
gtatttgtaa 
cgcggtaatt 
agattgtgaa 
ttgtttgatt 
ttttgatttt 
ttttttattt 
aattattttt 
tttattttta 
gtgttgggag 
agttagggaa 
agtaaacgtg 
taaatatttt 
gcgggagaga 
tttaaaaatt 
tgaatgggat 
tttcgaaggt 
attttgtttt 
gtttgttttt 
aataaaattt 
tttttatttt 
gtttagtttc 
ggttttcgaa 
gagggggcgc 
gggttttttt 
ggagcgggtg 
cgcgtggcgt 
gtaagcgagg 
tttatttaag 
ggcggagcgt 
cgcggtttta 



agaaaagagg 
tttcgaaatt 
tagaataata 
ttaagataat 
gttgagtaga 
gtttattggg 
tattttgatg 
ttgtttattg 
tttggtgttt 
tgatattttt 
tatttaaatt 
tagcgtagat 
agtttttagg 
taggtagtag 
tttttgagat 
aggggcgttt 
acgttagatt 
tcggagatcg 
tttgcgggaa 
tgtagttaga 
tgtttttagg 
ggttcgttcg 
tagtgatacg 
gtaggcggga 
agtttacgtt 
ggtgattgtt 
cggaagaggg 
ggagttcggg 
tcgcgtttaa 
tttcgtcgtc 
tagcgcgcgg 
attggaattt 
ggaggtggcg 



ttggttttgt 
ttgtattggg 
ataaaaaaag 
tggtttgggg 
tataaatatt 
tagttatata 
tgtggtttat 
ggttatattg 
tgtaaataat 
ttttgtattt 
agtgattaga 
agtgtgtagg 
gggagggggc 
gagttgagag 
tgacgggaat 
ggatttttgg 
tgtaattgtt 
aacgtttcgg 
gagggaattg 
aaataaatta 
tttatttatt 
gattggggcg 
ataattaacg 
gggtcggagg 
ttttttttag 
tttttaagga 
agaaagttgg 
cgtttttagc 
gatgtaaatc 
gtttacgttt 
gtcgcgattc 
aagggggcgt 
gcggtggcgg 



taggtttttg 
atttggaagt 
aatgttgaaa 
gttgatttta 
ttaaaataaa 
gaaggataaa 
attttaaatt 
aaagcgtttt 
agtaggtata 
tttttttttt 
taattattta 
gagttattta 
ggttttgtgg 
atgggattta 
ttggataggg 
aaggataaat 
gtgagtttaa 
agttttattt 
taattaagag 
tcgtaaagta 
tattttttag 
gttttttcgt 
atggtaggtc 
cggttacgta 
agtttttggg 
gttgtttttc 
gcgggattac 
gttcgcgagt 
gagttttcgt 
tttgcgcgtt 
ggtattcgtt 
atagcggcgc 
tggcggcggt 



atattaatat 
tttgtgttat 
tatatttagt 
ttttagtata 
aatgatattt 
ttttattaag 
gatgtgataa 
ggagattatt 
taagaaatat 
gtagtattta 
taaatatgat 
gaggttttat 
gttgtagttc 
gataagggtt 
tcgaagagtg 
atgtaggcgt 
gggcgcgatt 
ttttatttcg 
tagatttgaa 
ttttattgcg 
gttggttgta 
ttttttaggt 
gtagttgtat 
tttttatttt 
tgcgggttgg 
gaagtcgtta 

ggggcggggt 
cggggtggag 
tttttatttt 
acggtattag 
tggagcgcgc 
gttcgtatcg 
ggcggcggcg 



aacgtttagt 
ttattattta 
attaaaattt 
ttgatatgag 
atttgtgagg 
aataaagatt 
ttgcgtagat 
cgtttttgta 
ttagtgattt 
ggagttttta 
attttagaac 
ttatttggga 
gagggggaaa 
tttagatata 
atcgaggtat 
taaaggtggt 
taacgaaggt 
aagtgaagtg 
gttttatttt 
ggatattgag 
aggtaatttg 
ttttcgcgtt 
tgttggaagc 
cggggcgata 
ggcgagaggg 
agtttcgggc 
tggtgggtgg 
ttagtagttt 
ttttcgttcg 
agtaggcggt 
gggtaaggcg 
ttcggtttcg 
gcggcggcgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



1 



ggcgtagggt tgagcgagcg ttcgggtttc ggggtttcgg ggaaggcggt tgtagttttt 2040 

gagtgtagcg cggttttttg ttattgtttc ggttcggtta tttttttgtt atggttttgg 2100 

cggatagtat acgtggatta tttaacgggg gcggcggcgg gggcggtagt ggtttttcgt 2160 

cgtttttcgc ggagttatcg ttttttttcg atatcgtgga gttgaacgtg gggggttagg 2220 

tgtacgtgat tcggcgttgt acggtggtgt cggtgttcga ttcgttgttt tggcgtatgt 2280 

ttacgtagta gtagtcgtag gagttggttc gggatagtaa aggtcgtttt tttttggatc 2340 

gggacggttt tttttttcgt tatattttgg attatttgcg ggatttgtag ttcgtgttgt 2400 

tcgattattt tttcgagcgt agtcggttgt agcgcgaggt cgagtatttc gagttgttag 2460 

agttcgtgcg tcgtttcggg gcgttttagt agttcggttc ggggtcgtcg ttttcgcggc 2520 

gcggggtgta taaggagggt tcgttgggtg acgagttgtt gtcgtttggt tattcggagt 2580 

tcgaatagta ggagggcgtt tttgtcgggg cgtcgtcgtt tacgttggag ttggttagtc 2 640 

gtagttcgfct cgggggcgcg gcgggttcgt tgtttacgtc gttttagtcg ttggacggta 2700 

gtcggcgttc gggttatatt attatcggtt atcgcggttt ttatattatc gggcgggacg 2760 

cgtaggcgga cgttaagttt cggcgagtgg cgcgtattat cgtttgcgga aagacgtcgt 2820 

tggttaagga ggtgtttggg gatattttga acgaaagtcg ggatttcgat cgtttttcgg 2880 

agcgttatat ttcgcgttat tattttaagt ttaatttttt ggagtaggtt ttcgataagt 2940 

tgttcgagtc gggtttttat atggtggcgt gtagttttac gggtatttgc gtttttgtta 3000 

gtagtatcga ttagagcgag gataagattt ggattagtta tatcgagta 3 049 

<210> 18 
<211> 2407 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 18 



taaggtttgg gtatttttag gtagtaggga taaggtgggt tttttttttg gttgttaaat 60 

ttacgttaaa gtcgagttta gggattggag tttaagaaat ttatcgttta ttttttagtt 120 

cgatcgggga tttgtatgta tttttgtcgt gttgttttga gttttttaat tttttatatt 180 

ttttttttgt tatgtatacg tttttattta ggtttgtaaa agttttagtt ttttttaggg 240 

gtagggattc gtacgtcggt ttagggtttg gtacgcgggg atgttgaaat agggttaggt 300 

ttggttttta gtcgatcgtt agagttttaa ggtttagtaa ttttttttat aaaggtttcg 360 

ttaagaggcg aggaaataag agtcgggaga ggggcgcgga acggcgggcg ggacgaacga 420 

ttagtttcgc gttttcggtt agttgcgtcg agttaggggt atcgcggttg ttgtgcggtt 480 

ggaaatttag gaatgggaag gttcggggtt tgttcggttt cggaggtagt tggcgggtcg 540 

tttttggcgg cgttggagcg gttagtggta gtcgggtacg ggcgatcggg tcgttcgggt 600 

cgtttttaga tcgtgatttt cgaaaaattt tgcgggcggg gcgcgttcgc gtcgtttttt 660 

gtcggaaggt gcgagttagt gcgttcgatt gtgggcgggg gcggaaagag gcgcgtttta 720 

aagtggtaat agatggtttt tttatttaat aggattaaaa aatttgtttt tattcggtcg 780 

atcgcggaag tagagtaggc gggcggttaa tggggatatg atggggggcg gagtcgaggt 840 

tttcgaagcg gaagtgggtt gttgttgagg cggcggtatt tttttcgagg agtttttttg 900 

ggcggttgaa gaaggagttt ttttttcgga gtgcgtcggc ggtggcgttt gcggatttaa 960 

ttagttttag gttaggtcga gttttgcggg aaagtagcgg taagttaggg ttttgtagat 1020 

gcgaggttta ggtagtttcg cggtttatag aggtttcggt tcgcgttttt tgggggagtc 1080 

gcgttgcgcg gtttgattta gtcgaggttt tgtagttcgg gatttcgagt tagttttggt 1140 

cgttcgtatt gtcgttcgcg cgggcgtatc gagttcggtt tggcgcgggt aatagaagtt 1200 

aggaggtttg cgtttgggtt tcggtttatt ttggggggtc gcggttatgg ggtttagttt 1260 

ttagtttagg aagggaaatt gagattttgg gaggggtagg aacgttttta aggttatttg 1320 

gaaagtcggg taggatgtgt tgttaggggg aagattcggg tagggttttt gtttttcgtt 1380 

gacgacgttt tttttgtgtg ttcgcgtcgt cgtttcgtta tcgtggggtt tgcgagtttg 1440 

tcggggtgcg tgggtcgcgt ggcggggttt tttgtaggtc gggaggattt gagtacgggt 1500 

gcgggtttga tcgtgggggc gtcgaggtcg tagtttaaaa tttagtaggg tttcgatttt 1560 

cgggcgcgtt ttcgggtttc ggttggtggt tggtggaacg tgcgattgtg aggtttgcgg 1620 

tttagttttg tatcgttcgg gttttttatc gttttggcgc gtttatagat aggtgtatga 1680 

agatttttac gattcgaaat agagttgtta gtaaatatcg ttttttcgtt tttgatttat 1740 

cggggaagag ggaaaaggat agagtttggg taagtcgttt tggtagggat tttagttttt 1800 

gttttttatt tgttagtttt tatagacgtt tataaattta ataattttcg ttttgttttt 1860 

gtattaagtt tttgagttag acgtagggtt ttagttttgg agtttggttt agattgttta 1920 

attgattggg gagattgagg tttagaaaag tgaagtggtt tgtttaaggt tatatagtta 1980 

gttatttggt agtagatgag gttaagtttt atttgtaaga tttgggtttt gaatttattg 2040 

attaggagtt ttgggattat tgttaataaa agagatattg aagggaattt tttgttattt 2100 

ttttggtgat ttgtttttta atggataagg atatattggg tttagtttta tttgtgagtt 2160 

tgaggtgaaa tagaggtatt cgagtagtaa gatatattgt tggtttttgt attgtttgaa 2220 

tttgagtttt taaaaatttt attttaatat atcgtttatt gatttttttt tgaattatta 22 80 

tttttgtaag gattttttgt aaatattgtt tttttaattt ttatgaaatt ttaatgttat 2340 

acgtaaatta ttttttttta tataatgtat gtatatttgt gttttgtata taaaatgagt 2400 

aagattt 2407 



<210> 19 
<211> 2407 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 19 



aaattttatt 
tttacgtatg 
ataaaggtag 
gtttaaattt 
tattttaaat 
attaagaaag 
ttttggttaa 
aaatagttgg 
agttagttgg 
ttggtgtaga 
gaaagataaa 
tttttcgatg 
agaattttta 
ggttgggtcg 
gcgttcggaa 
ggttcgtatt 
atttcggtaa 
cgtcgttagc 
gattttttaa 
aggttaggaa 
attttttaat 
gcgagcgatt 
gtagcgcggt 
atttcgtatt 
gagttagtta 
agtcgtttag 
ttcggaggtt 
tcgcggtcgg 
attattttaa 
tttcggtaag 
gagggcgatt 
ttagggacga 
gatttttagt 
gagttggtcg 
tttttaacga 
aaattaggtt 
tttttgtttt 
gaggaagagt 
tcggtcggat 
gacgtgggtt 
gatttta 



tattttatgt 
gtattaagat 
taatttaaga 
aggtaatata 
ttatagataa 
taataaaaga 
tgaatttaaa 
ttatgtgatt 
atagtttaag 
aatagaacga 
agttgaaatt 
gattaaaggc 
tatatttgtt 
taagttttat 
atcgaggttt 
cgtatttaga 
attcgtaggt 
ggggaataaa 
gtgattttgg 
ttaagtttta 
ttttgttgtt 
agagttggtt 
ttttttaaga 
tgtaaggttt 
ggttcgtagg 
gagagttttt 
tcggtttcgt 
tcgggtaagg 
aacgcgtttt 
agacggcgcg 
cgggcgattc 
ttcgttagtt 
cgtataatag 
ttcgtttcgt 
ggtttttgtg 
tggttttgtt 
tggaggaagt 
gtggaggatt 
tggagaatgg 
tagtaattag 



ataaagtata 
tttatgaaga 
aaagattaat 
aaagttagta 
aattgaattt 
ttttttttaa 
atttaaattt 
ttgggtagat 
ttaggtttta 
agattattaa 
tttattaaaa 
ggaaaagcgg 
tataggcgcg 
agtcgtacgt 
tattaagttt 
ttttttcgat 
tttacgatgg 
aattttgttc 
gggcgttttt 
tggtcgcggt 
cgcgttaagt 
cgaggtttcg 
ggcgcgggtc 
tgatttatcg 
cgttatcgtc 
cgagaaagat 
tttttattat 
ataaattttt 
ttttcgtttt 
ggcgcgtttc 
ggtcgttcgt 
gttttcggag 
tcgcggtgtt 
tcgtcgtttc 
aggaaggttg 
ttagtatttt 
tgggattttt 
ggaggattta 
gcggtgggtt 
gaaaaaagtt 



gatgtgtata 
ttagaaaaat 
aaacgatgtg 
atatattttg 
aatatgtttt 
tgtttttttt 
tgtaggtagg 
tattttattt 
gag tt gaga t 
gtttgtgaac 
cggtttgttt 
tgtttattag 
ttagagcggt 
tttattaatt 
tagattgcga 
ttataaaagg 
cggggcggcg 
gggttttttt 
gtttttttta 
tttttagggt 
cgggttcggt 
ggttgtaaag 
gaggtttttg 
ttgttttttc 
ggcgtatttc 
gtcgtcgttt 
gtttttattg 
taattttatt 
cgtttataat 
gttcgtaagg 
gttcggttgt 
tcgagtaggt 
tttggttcga 
gcgttttttt 
ttgggttttg 
cgcgtgttaa 
gtaggtttgg 
gggtagtacg 
ttttgagttt 
tattttgttt 



tattatataa 
aatgtttata 
ttaaagtaag 
ttattcgaat 
tgtttattaa 
attgatagtg 
atttaatttt 
ttttggattt 
tttacgtttg 
gtttatggga 
aagttttatt 
taattttgtt 
gaagggttcg 
attagtcggg 
tttcggcgtt 
tttcgttacg 
gcgcgaatat 
tttaatagta 
gagttttagt 
gagtcgagat 
gcgttcgcgc 
tttcggttgg 
taggtcgcga 
gtaaagttcg 
ggagaagaag 
taatagtaat 
gtcgttcgtt 
ggataagaaa 
cgagcgtatt 
tttttcggga 
tattgatcgt 
ttcgaatttt 
cgtagttggt 
ttcggttttt 
ggattttgac 
gttttgggtc 
gtggagacgt 
gtagaggtgt 
tagtttttga 
ttgttgtttg 



aaagaaatag 
aaaggttttt 
atttttggaa 
gtttttattt 
aaagtaaatt 
gttttaaaat 
atttgttgtt 
tagttttttt 
gtttaagaat 
attgataagt 
tttttttttt 
tcgggtcgtg 
agcggtgtag 
gttcggaagc 
tttacggtta 
cggtttacgt 
ataaaggagg 
tattttgttc 
tttttttttt 
ttagacgtag 
ggacggtagt 
gttaagtcgc 
agttgtttaa 
gtttaatttg 
tttttttttt 
ttattttcgt 
tattttattt 
attatttgtt 
aattcgtatt 
gttacggttt 
tttaacgtcg 
tttattttta 
cggaggcgcg 
gttttttcgt 
gatcggttgg 
ggcgtgcggg 
gtatataata 
atgtaggttt 
gttcgatttt 
agaatattta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2407 



<210> 20 

<211> 3952 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 20 



ggtcgtagtc 
tattcgggac 
agaggtttta 
ttttttgttt 
ggcgggttgg 
cgtagttaga 
agcgtagcgt 
gggaggcggg 
ttgttgttgt 



ggagaggaat 
ggacgcgtag 
agggataagg 
ttgtttttta 
gcggggcgga 
gcgagtcgag 
agttttatta 
cgatgggggt 
ttttgggggt 



aggaatcgta 
tagcgcgggt 
tagggagaag 
tttcgaggcg 
ggcggggtcg 
tcgcggttag 
gttcgtaaag 
aggtgttatc 
gagtgttagt 



gtggggacgt 
cgggaattgg 
cgtagcgggg 
gtagggcggg 
tagtattggt 
tttcggcggg 
cggatcgagt 
ggtcgcgtta 
cggagggggt 



tttggttttc 
gtattagggc 
tgcggggaat 
cgggcgcggt 
tttatttagt 
tagggggggc 
tggaagtcga 
tggacgggtc 
tcgttttttt 



gggtttatag 
gggatgggtg 
cgtacgtttt 
ttcgggggtg 
ttttttcgtt 
gttggagcgt 
gcgttgtcgc 
gcgtttgttg 
tttcgggatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 



agaatttcga gaagagtcgg gcgtcgttat taaggaaata gaatagagta ttggggtttt 600 

agatattgag ggtgggtggt gggaaaggat ttttgatgtc gggattacga aggagggttt 660 

agggttttcg gagcgtagag gcgatttttt agggtggaga tgagggtaag atcggagtac 720 

ggatgtcggt ttttaggtat cgtagggggc ggtgggggag ttgggagggg ttttttaaga 780 

gggggtatgg ggtttttcga tgtttaggtt tttcggaaga ggatattcga atgtcgggat 840 

ttcgaaggga tttttttttt agtatttcgg tttttggaga gtcgtgggat agaattaagg 900 

cgagagaagg agggggaatt ggacggggat tagagatgag aggggcgagt tgggttgggg 960 

cggggagttc gggacgacgg gatggaataa tggaggaggt cggagggatt ggaggtagag 1020 

gggattcggt aaggataatg gggatcggtt agataatggg ggggaggggc gtcggaggga 1080 

gaggttgggg agcgggagag aggttgagaa gtgggttagg gagggtagaa gaggggcgac 1140 

ggcgagggcg gtagagagat ggaagtagaa ggaaaggaga tagagcgggg cgcggatttt 1200 

gaagtggtcg gggtttcgtt tgttcgttga gggatagtga gacgagaaga gagagtagag 1260 

gtttggatat ttttagcgtt tgttttcgcg tttttgttcg cggttttttt tagtttcgga 1320 

attggggttg cggatttaag gttgggaggg gagtgtttat ttcgtcgggg cgaattcgtt 1380 

ttcggggttt ttttacggtc ggcgggtttt ttcgacgcgc gtgtttttac ggtttgaagt 1440 

agcgtagcgg atcgcgaggg acggagcggg tgtgttttta ttcgggttgg cggtggttac 1500 

gtttcggggt gtagagcgtc ggggcgtatt aggcggggat tttttttggt cgcgcgtagt 1560 

tttggttacg taggcgggag gcggtaggta gggggcgttg cggggcgagg taagtgtagt 1620 

tcgtcgttgt ttgtttgcga gtttggggtt gcgcgtcgtt ttttatttag ttgcgcgatt 1680 

cgagcgtcgc gtttatcgcg tttcgtgggt tttcgcgttt cgaagggtcg ggattcggtt 1740 

tgagtttttt ttggttttcg gtgtaagtcg ttgggtattt aacgagacgg gggcgttcgg 1800 

acgtttcgga ttttgtagcg cggtatttcg ggttatcgat tcggtaaagg tcggcgggtt 1860 

tttcgggtgg ttcgacgaga ttgttagatg tggtttgttt tatggttatg aatcgaatgt 1920 

tttggaaggt ttggtaacgg aggaggcgtt tttatttttt tcgttagtat tttttattag 1980 

agtattgcgt tttaaatttc ggttttatag atacggagat tgaggttttg agagtttaaa 2040 

cgatttggtt aagattttat agtaagttgg gttagagtcg tggttagatt tttcgatttg 2100 

atttttagtt cggtattttt ttattttttt agggggtagt cgttgtttta gttcggggtt 2160 

acggttgcgt tttttagtag ggcggggggg gttgatattg tttttagagt tcgtcggcgg 2220 

tagagtttgc gggtgaggtg ggtagagatt ttgttcgggt gcgggtttta gttcgaaagt 2280 

ttagttttag ttttttggag ttgttaggat ttgtggttgc gttcggggat ttgggagcga 2340 

tcgcgggtac gtttgttgat taacgtcgtt ggttagagac gttgtttaga cgcgggtggg 2400 

cgagcgtaga ttaggagagc ggggaggagg gatcggattt tagaggggta acgatttggg 2460 

tttcgggttt gggtaagagc gagggttttc gcgcgaagat ttaaggaagg aggggagttg 2520 

ggcgtggagg atttgggcgt ttgggaaggt gtatttcgat aggaatttgt cgtttgtagt 2 580 

ttttgcggtt ttcgttttcg tttcggcgcg cggttgttcg gtttgggatg gttgtgcgcg 2640 

gttcgagttt ttttttgggt cgcggtgaga gagagagggt tgaaattaga gcgcgtttcg 2700 

gcgggtcggt tggcgggtcg cgtcggtaat ggaggtattt tgttatttag acgtttgtaa 2760 

ttagagtcgt cgggttggtt aatgcgttta atagggatgg aacgagggta gtaaatgggc 2820 

gtgcgtgagc ggtcgggttg agttgggtgg atggtggatg gggaggtagt ttcgcggggg 2880 

acgggttttt atcgtattgt tttagtttag tttggtttaa gcgtttcgtt ttttttttgg 2940 

gggatcgcgt ggggaggcgg gaataatcga tgtgttgagt ttttattaag tgttttgtat 3000 

ttttttggta gttttttttg aattatttaa tttttataag gttttagtaa ggtgaattta 3060 

gttattttta tttcgtagtt gaggaaatag aattttttag atttttaggt tcgagattag 3120 

gtttttttag tgagttagtg gtggcgtttt tgttggaatt taggatgttt ggtttgggat 3180 

gttttttttt ttatatatgt taggtttttt ttcgtttttt attttttaga ttatggaagg 3240 

agtcgtggtt ttgagagatt aaatcgtgtt ttatgttttt ggttttatat atttagattt 3300 

gagttttttt tttatacgta gaggtttfcgt ttttttttaa gtttttatgt tatattattt 3360 

tgggtaatta gttttttttt ttttaatttt ttttttattt aaagggagtg aggggtatgt 3420 

gttagtgttt tttttgggtt tgtatcgatt ttaaaaggtt ttttaggtgt ttgaaaattt 3480 

tttttatttt atggcggtga ttacggtttt atttatttaa gagttttttg ttgaagttgt 3540 

tggaatggtt agggttgatt atattgaaat attcgtatgg ttttaaatat ttttaattgt 3600 

ttcgttgttt tttttttttt tttaattaaa aaaaatgaaa atttttaaaa gttaatttat 3 660 

ttaaggattt ttttatttta gttttttgta agttttgaaa tggttaaaat aaaaagatta 3720 

taagcgtttt taaaaatgag atgaaattat tttgttaaat gtgaaatgat taagtgtaat 3780 

atatataaat tttagggaaa tttatttgtt atttttataa attaattttt ttggatttga 3 840 

ttttggagta attagtttga atgttagtta ttttttttta tttggttaaa tttttttttt 3900 

ttgtttgggg tttttttaag atgagatgtt tttttgataa ttaatattta tt 3952 

<210> 21 
<211> 3952 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 21 

aataaatgtt agttattaaa gaagtatttt attttaagga gattttaggt 
agtttagtta agtagggagg ggtgattaat atttaggtta attgttttaa 
aaaaggatta atttataggg atgatagata aatttttttg aaatttgtat 



aggaggggag 60 
aattagattt 120 
gtgttatatt 180 



tagttatttt atatttgata gaatgatttt attttatttt taagaacgtt tgtaattttt 240 

ttgtttcggt tattttaagg tttataagga attaaggtgg aaggattttt agatgaattg 300 

atttttaaga atttttattt tttttaatta gagaaaagag agaaatagcg aaatagttga 360 

aagtatttga aattatacga atgttttagt ataattagtt ttggttattt taatagtttt 420 

agtaaagagt ttttaagtga atggggtcgt ggttatcgtt atgggatggg agagattttt 480 

aaatatttgg agggtttttt gaggtcgatg taaatttagg ggaggtattg gtatatattt 540 

tttatttttt ttggataggg gaggagttgg gaggaaagga gttaattatt taagataatg 600 

tagtatagag atttgagagg gggtagggtt tttgcgtgtg gagaaggggt ttagatttaa 660 

atgtgtgggg ttaaaggtat agggtacgat ttaatttttt agagttacga ttttttttat 720 

gatttagaag gtagggggcg ggagggagtt tggtatatgt aggaaagaga atattttagg 780 

ttaagtattt tgggttttaa taaagacgtt attattgatt tattggggga gtttggtttc 840 

gggtttgaaa gtttagaagg ttttgttttt ttaattgcga agtgggaata attaaattta 900 

ttttgttagg attttataag gattaaataa tttaagagaa attattaggg aagtgtaagg 960 

tatttagtag gaatttagta tatcgattat tttcgttttt ttacgcggtt ttttagggaa 1020 

gaagcgaggc gtttgagtta gattgggttg gaatagtgcg gtaggggttc gtttttcgcg 1080 

gagttgtttt tttatttatt atttatttag tttagttcgg tcgtttacgt acgtttattt 1140 

gttgttttcg ttttattttt attaggcgta ttagttagtt cggcggtttt ggttatagac 1200 

gtttgaatga taaagtgttt ttattatcgg cgcggttcgt tagtcgattc gtcgggacgc 1260 

gttttggttt tagttttttt ttttttatcg cggtttagga agaaattcgg atcgcgtata 1320 

gttattttag atcgagtagt cgcgcgtcga ggcggaggcg ggagtcgtag gggttgtaga 1380 

cggtaggttt ttgtcggggt atattttttt aagcgtttag gttttttacg tttagttttt 1440 

tttttttttt gggttttcgc gcggggattt tcgtttttgt ttagattcgg agtttaagtc 1500 

gttgtttttt tgggattcgg tttttttttt tcgttttttt ggtttacgtt cgtttattcg 1560 

cgtttgagta gcgtttttaa ttagcggcgt tagttagtag acgtgttcgc ggtcgttttt 162 0 

aaattttcgg acgtagttat aggttttgat agttttaggg aattggggtt gagttttcgg 1680 

gttggggttc gtattcggat agaatttttg tttattttat tcgtaggttt tatcgtcgac 1740 

ggattttggg gatagtgtta atttttttcg ttttgttggg aaacgtagtc gtgatttcga 1800 

gttgggatag cggttgtttt ttgaaaaggt gggggagtat cgagttggga attaggtcgg 1860 

ggagtttagt tacgattttg gtttaatttg ttgtgagatt ttggttaagt cgtttaaatt 1920 

tttagagttt tagttttcgt atttgtaaag tcggaatttg gggcgtagtg ttttgatgaa 1980 

agatgttggc ggggagagtg aagacgtttt tttcgttgtt agatttttta gggtattcgg 2040 

tttatggtta taaagtaggt tatatttgat aatttcgtcg gattattcgg aggattcgtc 2100 

gatttttgtc gagtcggtgg ttcgggatat cgcgttatag aattcgaggc gttcgggcgt 2160 

tttcgtttcg ttaggtgttt agcggtttgt atcgagagtt aggagaggtt tagatcggat 2220 

ttcgattttt cgaggcgcgg gagtttacgg agcgcggtgg gcgcggcgtt cgggtcgcgt 2280 

agttaggtgg ggagcggcgc gtagttttag attcgtaggt aggtagcggc ggattgtatt 2340 

tgtttcgttt cgtagcgttt tttgtttgtc gtttttcgtt tgcgtagtta gagttgcgcg 2400 

cggttaggaa gggttttcgt ttagtgcgtt tcggcgtttt gtatttcgag acgtagttat 2460 

cgttagttcg ggtaggggta tattcgtttc gtttttcgcg attcgttgcg ttgttttaag 2520 

tcgtgagaat acgcgcgtcg gaggagttcg tcggtcgtgg gggaatttcg ggagcgggtt 2580 

cgtttcggcg aagtgggtat ttttttttta gttttagatt cgtagtttta atttcgggat 2640 

tgggagaggt cgcgagtagg agcgcgggga taggcgttgg aaatgtttaa gtttttgttt 2700 

ttttttttcg ttttattgtt ttttagcggg taggcgggat ttcgattatt ttagggttcg 2760 

cgtttcgttt tgtttttttt ttttttgttt ttattttttt gtcgttttcg tcgtcgtttt 2820 

ttttttgttt tttttaattt attttttaat ttttttttcg ttttttaatt ttttttttcg 2880 

acgttttttt tttttattgt ttggtcggtt tttattgttt ttgtcgggtt ttttttgttt 2940 

ttagtttttt cgattttttt tattgtttta tttcgtcgtt tcgggttttt cgttttagtt 3000 

tagttcgttt tttttatttt tagttttcgt ttagtttttt tttttttttt cgttttggtt 3060 

ttgttttacg attttttaga gatcgagatg ttgaggggaa agttttttcg ggatttcggt 3120 

attcgagtgt tttttttcga agaatttggg tatcggagag ttttatgttt ttttttgaaa 3180 

gatttttttt agttttttta tcgttttttg cggtatttga ggatcggtat tcgtgtttcg 3240 

gttttgtttt tatttttatt ttggagagtc gtttttgcgt ttcgggaatt ttagattttt 3300 

tttcgtggtt tcggtattag aggttttttt ttattattta tttttagtat ttgggatttt 3360 

aatgttttgt tttgtttttt tggtggcggc gttcggtttt tttcggagtt ttgatttcgg 3420 

gaaagggagc gggttttttt cggttaatat ttatttttag aagtagtaat agtagtaggc 3480 

gcggttcgtt tatggcgcgg tcggtggtat ttgtttttat cgttcgtttt tcgcggtagc 3540 

gttcgatttt tagttcggtt cgttttgcgg attgatgggg ttgcgttgcg ttgcgtttta 3600 

gcgttttttt tgttcgtcgg agttggtcgc ggttcggttc gttttggttg cgggcgggag 3660 

aggttgggtg aagttagtgt tgcggtttcg ttttcgtttc gtttagttcg tttattttcg 3720 

gaatcgcgtt cgttcgtttt gtcgtttcgg ggtgggaagt agaggtaaag ggagggcgtg 3780 

cggtttttcg tatttcgttg cgtttttttt tgttttgttt tttagagttt tttatttatt 3840 

tcgttttggt atttagtttt cggttcgcgt tattgcgcgt tcgtttcgga tgttgtgggt 3900 

tcggggatta gggcgttttt attgcggttt ttgttttttt tcggttgcgg tt 3952 

<210> 22 
<211> 2436 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 22 

agttttggat gataaggatt aattgtttag ggcgttggtc gagaggaata ggaagagcgt 60 

ttcgtggagt agggcgggcg agaagagtcg ggcggtcgcg gggttttcgc gtatgttggg 120 

taggaagagg aagagttgta gtacgaaggt tattagggag atgtataaga agtaggcggg 180 

ggttattacg ttgagtagtc gtagggttag tattttcgat tatatttttg cggggttcgg 240 

ggtgagaaga ggattgattg attacggcgt ttagggggcg ttttcgttcg ttttttcgtg 300 

tcgcgcgttt ggggtattgt acgcgatttt acgtttttaa ttttttgttt cgcggatttt 360 

ttttttttag ttttttaatt tcggattagg taggttagag attcgagttc gtagcggttt 420 

cgggtttagg ttttatttgg gggtgagtgt tgtttggggt tttgagcgag ttcggcgttt 480 

tggcgcggta gagtttggta taggagggag ggattcgttt gtagttttta ggagttgtga 540 

gggtgggggg tatcggggga taggattttt tttttttgtg gatgtggaaa taagtttagg 600 

tttgggattt ttggtttaga tttttttagg aggagaaagg gaaggatttg gggtcggggg 660 

taggagaatt agcggtattt gatttgttgg aaattttgtt ttatattttt tttttttcgt 720 

ttttgcgttt ttatttatat atatattttg ggtagtattt agggatttta ttaggttatt 780 

taaggttttt tggaggtatg tgtgcgtgag ggattacgat gtgtgtgtgt gtgtttttgt 840 

ttgtgatttt gtacgtgtaa tttgtgtgcg tttgagtatt tgtatttgtg tatgtatata 900 

tttgcgtgta tagatacgta gtttttgtta tttagtttta aaaaaattaa ttgaaataat 960 

cgatatattt atattcgttt tcgggtatcg ggtgaggtgg gggcgtaatt agaatatttg 1020 

tgttagcggg ttttggttag tatatttttt taacgttggt tgttcgggtt atttttttgc 1080 

gaagagcgtt tttatttgag gcgaaattaa aatttttttt tcgcgttcgt ttttcgtttt 1140 

tttgttgaga gaaatttaaa taatttttat ggcgtcgaaa ttttttttat gtcgataagt 1200 

tcgttttggg cgtacgagtg gatgtttggt tagttttttt tgggatcgat ttcgtcgtcg 1260 

tgtttagttt ttattacgtt tttattttat tttacgtttt atagtcgggg tttttggtta 1320 

gtttcggaag ttatcgagaa ataggatttc gtgcgttcga gagaattttt ttaggggtta 1380 

aggaatcggt tagtcggagg cgcgagaaaa gttttgggaa ggcggttgta tttaggatgg 1440 

gtggatgtta cggggfctttc gtttaggttg tttcgttcgt acgggtcgat tggttatttt 1500 

ggaatttttt ttgtaggttt tagcgttttt cgggaattcg tagttttcgc ggagagcgtg 1560 

ggtttttttt tatcgttggg gcgtagcgta gtgtacgttt gagggtggtc gtcgggggtt 1620 

gggtacgttt ttagttttgc gtcgtcgggg gttgcggcgg tgttgtttat tttagagagt 1680 

tttcggtttg gggtttcggc gaagtaagtg ttttttcggc gtcggtcgtt aggggggcgc 1740 

gggagtagtt agatgcgtcg tagcgttggg aaggcggcga aggatagggg ttaggggagt 1800 

gaggggcgtt cggtaggtag ttttagtttt ggttttgcgc gggagaaggg atagtagaga 1860 

tcgttcgtgg ggtttcgggg tgtagggagt tgttcgttta gttttggcgt tcgtttcgtt 1920 

cgcggtagag ggcggtatag tcggagtttt ggaaagatcg gtagcgtcgg tagtcgcggg 1980 

tttttcggtt attgtttttc ggacgtacgg gaagtcgttt ttcgcgtcgt cgtcgtcgta 2040 

ttgtcgtcgt cgtagagggg tgaggaaatt aatttatcga gttttggtcg tcgataagag 2100 

gagtttcgga cgtcggtttt cgttttgttc gaggttgtaa agttgtggat tcggttcggt 2160 

tggttcgtag tttgcgtttc gtttcgggaa ttgggtaagt agcggggatg tggggaggag 2220 

ggagcgggta gttgttggtt ttttatttgg gcgttagcga gtaagggtta ggaagcggcg 2280 

gggatggtag ttgggtattt tttagtttcg ttattttttt tttcgttttt gggggcggtg 2340 

ttttggtcgt tagttttagt atcgtggatt tgtttttttt ttttttttcg attttttttg 2400 

agcgggttgg ggcgttttgt ttggttttag cgttcg 2436 

<210> 23 
<211> 2436 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 23 

cgggcgttga aattaggtag ggcgttttag ttcgtttaag gggaatcgag aggggaggag 60 

ggggtaagtt tacgatgttg agattggcgg ttagagtatc gtttttagga gcggagggag 120 

gggtggcggg gttggggggt gtttagttgt tattttcgtc gttttttggt ttttattcgt 180 

tggcgtttaa gtgggaagtt agtagttgtt cgtttttttt tttttatatt ttcgttattt 240 

gtttagtttt cgaagcgaag cgtaggttgc gagttagtcg ggtcgagttt ataattttgt 300 

agtttcgggt agggcgagag tcggcgttcg gggttttttt tgtcggcgat tagagttcgg 360 

tgagttgatt tttttatttt tttgcgacgg cggtagtgcg gcggcggcgg cgcggggggc 420 

ggtttttcgt gcgttcggga ggtagtggtc gagaagttcg cggttgtcgg cgttgtcggt 480 

ttttttaagg tttcggttgt gtcgtttttt gtcgcgggcg aggcgggcgt tagggttgaa 540 

cggatagttt tttatatttc ggggttttac gggcggtttt tgttgttttt tttttcgcgt 600 

agggttaggg ttgaggttgt ttgtcgagcg ttttttattt ttttagtttt tgtttttcgt 660 

cgttttttta gcgttgcggc gtatttggtt gttttcgcgt ttttttggcg atcggcgtcg 720 

ggaaggtatt tgtttcgtcg gagttttagg tcgagggttt tttggggtgg gtagtatcgt 780 

cgtagttttc ggcggcgtag gattgggggc gtgtttagtt ttcggcgatt atttttaggc 840 

gtgtattgcg ttgcgtttta gcggtaggga gaggtttacg tttttcgcgg aggttgcggg 900 

ttttcggggg gcgttgggat ttgtaaaggg gattttaagg tgattagtcg attcgtgcgg 960 



acgggatagt ttggacggag gtttcgtggt atttatttat 
tagaattttt ttcgcgtttt cggttggtcg attttttagt 
gcgtacggaa ttttattttt cggtggtttt cgaagttggt 
cgtggggtgg gatggggacg tggtgagggt tgagtacggc 
aggttggtta agtatttatt cgtgcgttta gggcgggttt 
ggcgttatga gggttgtttg ggtttttttt agtaggaggg 
gggattttaa tttcgtttta aatgaaaacg tttttcgtaa 
gcgttggaga gatgtgttgg ttaggattcg ttgatatagg 
tttattcggt gttcgagagc gagtgtgagt gtatcgattg 
gttagatagt agaggttgcg tgtttgtata cgtagatata 
tttaagcgta tatagattat acgtatagag ttatagataa 
tggtttttta cgtatatata tttttaaaga gttttgggta 
gttgtttaag atgtgtgtgt gggtggggac gtagggacgg 
ggatttttag taggttaagt gtcgttgatt tttttgtttt 
ttttttttta gggaggtttg aattagaaat tttaaatttg 
gaggggagga gttttatttt tcggtgtttt ttatttttat 
gagttttttt tttttgtgtt agattttgtc gcgttaaggc 
taggtaatat ttatttttag gtggggttta gattcgaggt 
ggtttgtttg attcggggtt agggaattgg ggaggggaaa 
gggcgtggag tcgcgtgtag tgttttagac gcgcggtacg 
ttttggacgt cgtagttagt tagttttttt tttatttcgg 
aggatgttgg ttttgcggtt gtttaacgtg gtggttttcg 
ttggtgattt tcgtgttgta gttttttttt tttttgttta 
gtcgttcggt ttttttcgtt cgttttgttt tacggggcgt 
aacgttttgg gtaattagtt tttgttattt agaatt 

<210> 24 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 24 

agatttattt aaatttaaga atgagaatat aaatttatat tttgaagtgt tttatagaaa 60 

ggtttatttt aatgtttgga gtatatattt taatgaatat ttattttatt ttattttttt 12 0 

ttatttttga attaagtaat tttgaattta aagttgttat gattagtatt gaaaagatta 180 

ttggattatt aattgtgtga ttttgggata gtaatttttt gtattttagt ttgtttatat 240 

gttatatatg aaggttgaag tttgattttg ttttgtgatt attattttaa atatttgatg 300 

aaattaaatt ttagtgtttg gaatggtagt ataataaatt tattaagaat aaataattta 3 60 

ttgtaaaaat atattgattt ttaaatgatg taattgatag ttatattatt gtagagggtt 42 0 

gataaataat aaaagaaatg aaagatgtat atggtgagaa ttgaaattat tttgataagt 480 

tttttatttg tttattattt aaaattaatg attatgttga atgtttataa attataaaat 540 

ataaaagaaa ttttataaat gcgtatgtat aggagtttaa gttattaaaa gttttaaagt 600 

ataagtttaa attaaattaa ttaaagaagt tgagaggaaa aattggtttt tatttttaat 660 

tattattgtt ttgaggtttt atgtttaata taatttttta agtagaggtt ttagagagaa 72 0 

gagttgtgag gatattttta tatttgtgta gaaggaaaag tttgttattt attttagtat 7 80 

ttttagtgtt atattgatgt gtattttgga tttattttgt ttttattgta taaatttata 840 

tttgatttta aagaaaagga aaatttaaag tttttttttt ttaaggggat agaaattttt 900 

tgtgttaatt gtttgatttt tttttttgta aggttttatt ggaaattttt tgtaatataa 960 

tgtaggggat ttttttatgt gttgatgttg tttatatagt ggggtgggtt tgattgaaga 102 0 

aaaaaaatcg tatatacgta tgaaagatta tggttttatt ttcggaaagt atgaaaggtg 1080 

attgatattt ttaagaagtt tttgttattt aggaaaatta ttaaatattt tatttagaga 1140 

tatttggaaa gattgaagga aaggaagaac gaagaaagta gaatttagat ttatgtgggg 1200 

agagatttgt ggtagaggaa aagtattttt tttgaattcg ataagggatt tgtttggggg 1260 

aattttttgt ttagtttttt attattaggg ttttttgaag tcgggttttt tattgggtag 1320 

ttttttggga gtgtagtggg gaatttttat attttttttt taggttttcg aaggatttcg 1380 

tttttttagt gtttttttta ggttggtagg agttttgagt ttgatatttt tttttgatgg 1440 

gataggtaag ttttgtgggc gcgtaaatac gttgtaatta agttttttgt tgattttata 1500 

gttttgtgtg ttttcgagaa gaagtgatcg tatttaattg tttattgttg gtttgttttt 1560 

taagagtttg ggggtttttt ttttttaatt tagaattagt tgtacggggg gcggggaaat 162 0 

gggggtgggg aaggagtggg agggtagtgg ttttcgcgag tagagcgatg ttattgagtg 1680 

agtttttgaa tggggagcgt tgttgttttt aagtcgattg gtattttttg ttaggaagaa 1740 

acgttaagag gtgggagtgt ttggggaggg aggtaggcgg tttttatcgt aggcgcgggg 1800 

agttgttttt tcgttttttc gtttgttttt taagtttgga tttttaggag tggttgaagt 1860 

tgcggagcgt ttttggagtt tgtgaatgaa tttttttttt tttttttttt ttttttttcg 192 0 

ttgagttttt ttttcggttt tgacggtata gtgatataat gatgatgggt gttataattc 1980 

gtatttgaat ttgtaggcga gttgtttcga gtttttttgg ggaagaattt taggcgtgcg 2040 

gacgtaatag tcgagaatat taggtgttgt ggataggagt tgggattaag attttcggtt 2100 



tttaggtgta atcgtttttt 1020 

ttttggaaaa gttttttcgg 1080 

taggaatttc ggttgtgagg 1140 

ggcggggtcg attttaggaa 1200 

gtcggtatgg aaaggatttc 1260 

cgggaggcgg gcgcgaaggg 132 0 

agaaatagtt cgggtagtta 13 80 

tattttggtt gcgtttttat 1440 

ttttaattga tttttttaaa 1500 

tatatatata aatatagata 1560 

ggatatatat atatatatcg 1620 

atttggtgag gtttttgagt 1680 

agaaagggga gtgtgaagta 1740 

cgattttaaa tttttttttt 1800 

ggtttgtttt tatatttata 1860 

aatttttaaa gattgtaaac 1920 

gtcgagttcg tttaaaattt 1980 

cgttgcgggt tcgggttttt 2040 

ttcgcgggat agggggttgg 2100 

gaggagcggg cgagggcgtt 2160 

gtttcgtagg aatgtaatcg 2220 

tttatttttt gtgtattttt 2280 

gtatgcgcga ggatttcgcg 2340 

tttttttatt tttttcggtt 2400 

2436 



agtttcgtat 
ttattacgtt 
gtgaattttt 
ttttttttaa 
gaagtaattg 
gtaggtattt 
ggattttggt 



tttttcgtat 
ttttatgcga 
tttttaaatt 
aaaataaaaa 
gtagaaggtt 
aaaaaatgta 



tttttagtat 
ttcgtttggg 
gtaataagtc 
taaaaaattt 
ttaattaagg 
ggtaattttt 



cgtttcgtat 
taacgtcgaa 
gttttttaag 
atagaaaaaa 
taaagagttg 
ataagggttt 



ttttcgtatt 
tttagtcgcg 
gtaattacgt 
attcgcgagt 
taaggcgaag 
ttggggagag 



ttttttcggg 
tagcgttgta 
tttttttgtt 
ttagaaaaaa 
ttaagaaaat 
gtatatagag 



<210> 25 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 25 

attaaggttt 
aagtgtttat 
taattgtttt 
ttaagggggg 
gaaaatttat 
agcgtggtgg 
atgcggggtt 
ttgttgcgtt 
gtttaaatgc 
ggagatttag 
gcgtttcgta 
aaaggtagtt 
tttttggcgt 
tttagggatt 
ttttattttt 
taggttttta 
tatataaaat 
tttgtttgtt 
attgagaaaa 
ttttagggaa 
ataggaaatt 
ataaattttt 
tttttaagta 
aagtattaat 
cgattttttt 
gttttttgta 
agttgatata 
tgaagttaag 
aatattaggg 
tttataattt 
aatagtagtg 
ttaaatttat 
ttttttttat 
taggtagaag 
gttatttatt 
gtttttgtag 
aatttgattt 
tatgtataat 
aatagtttag 
ttaggaatgg 
aagatagatt 
gagtaaattt 



ttttgtatgt 
atttttttaa 
ttttttttaa 
aataaaagaa 
tgtagcgttg 
ttcggggaag 
ggtcgaagat 
cgtacgtttg 
gggttgtgat 
cgagaagaag 
gttttagtta 
tttcgcgttt 
ttttttttga 
tatttagtaa 
attttttcgt 
gggggtaggt 

tgtaaaatta 
ttattaaagg 
cgagattttt 
ttgtttaatg 
tttttaggta 
ttttatataa 
tttttgagta 
tattttttat 
tttttagtta 
ttgtgttata 
aaggattttt 
tatgagttta 
atattagaat 
ttttttttga 
attaaagatg 
gttttaaaat 
attttgtaat 
atttgttagg 
agttttttgt 
tgaattattt 
tattagatgt 
atgtaaataa 
tggttttttt 
agagaaataa 
tttttgtgaa 



ttttttttaa 
tttcgtttta 
gttcgcgggt 
acgtgattat 
cgcgattggg 
gatgcggagg 
tttggtttta 
gagttttttt 
atttattatt 
gaggagggag 
tttttaagag 
gcggtaggga 
taagaagtat 
tatcgttttg 
ttttcgtgta 
tagtaataga 
gtaaagaatt 
gaagtgttag 
cggggattta 
gggagttcgg 
aattttttgt 
gtttagattt 
gaatatttga 
gtttttcgga 
ggtttatttt 
aaagattttt 
gtttttttag 
tataatagga 
ggatggtaaa 
agtttttatt 
aaagttaatt 
ttttagtaat 
ttgtaggtat 
ataattttag 
agtaatataa 
atttttagta 
ttagaatgat 
attaaggtgt 
agtgttaatt 
aataaaatga 
atattttaag 



aagtttttat 
tagttttttg 
ttttttttat 
tttggaaggc 
ttcggcgttg 
gtgcgggacg 
gtttttgttt 
ttagaaaggt 
attatattat 
aggaggaggg 
tttaggtttg 
tcgtttgttt 
taatcggttt 
ttcgcggaaa 
gttagttttg 
taattgagta 
tggttatagc 
gtttaaggtt 
gagggaaagt 
ttttaaaaga 
cggatttaaa 
tgtttttttc 
taattttttt 
aataagatta 
attgtgtaaa 
aataggattt 
aaaagaggga 
ataaaataaa 
tttttttttt 
tagaaaatta 
ttttttttta 
ttagattttt 
ttagtatggt 
tttttattat 
ttgttagtta 
aatttattgt 
agttatagag 
agaaagttat 
atagtaattt 
atgtttattg 
atgtggattt 



gaaagttatt 
ttttaattaa 
aagttttttg 
ggtttattgt 
tttaggcggg 
gtgttggaag 
ataatattta 
tcggggtagt 
tgtatcgtta 
tttatttata 
gaaagtaggc 
ttttttttag 
ggggatagta 
ttattgtttt 
ggttagggga 
cgattatttt 
gtgtttacgc 
tttgttaatt 
gtaagaattt 
ttttggtaat 
gagaatattt 
gttttttttt 
gagtaatagg 
taatttttta 
tagtattaat 
tatagagaaa 
ttttggattt 
tttaaggtgt 
tatataaata 
tattaaatat 
atttttttga 
gtatatgcgt 
tattgatttt 
gtgtattttt 
tattatttgg 
attattattt 
tagaattaga 
tgttttaaag 
tagatttaag 
aaatatatat 
gtatttttat 



tgtatttttt 
agttttttat 
tttttgtttt 
agtttggggg 
ttatatagga 
atgcgggagg 
atgttttcgg 
tcgtttgtaa 
gagtcgagga 
ggttttaaaa 
ggaggggcgg 
gtatttttat 
gcgtttttta 
tttatttttt 
aaggagfcttt 
ttttcgggag 
gtttatagag 
tgaaagagat 
tttattgtat 
aaaaggttgg 
tttttttgtt 
ttttttagtt 
gattttttgg 
tgcgtatatg 
atatggaaga 
aggg 1 1 aaa t 
tttttttttt 
atattagtat 
tgaaagtatt 
aggattttaa 
ttagtttggt 
atttataaga 
aag t ga t aaa 
tatttttttt 
aaattaatgt 
taaatattga 
ttttaatttt 
ttatataatt 
attgtttgat 
tttagatatt 
ttttaaattt 



<210> 26 
<211> 2470 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> chemically treated genomic 



2160 
2220 
2280 
2340 
2400 
2460 
2470 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



DNA (Homo sapiens) 



<400> 26 



aaaattgaat atttgagaat attggattta tatttttgag ggtaataatt tgttagagtt 60 

gagtagttgt tttttttagt tatggtatgg atattttaat ttgttatttt ttttattatt 120 

ttttattgtt ttatataagg ttagttttat ttatttgttt tatttgtttg gtttttgtag 180 

ttatttgaat ttgtaaatat tatttagatt ttttttgttt tttattaaag tttattttgt 240 

tttgttttaa gttttttatt agtgttttaa agtaagagtt aggtaatttg attaaatata 300 

ggttttttaa tatgtgtttt ttggagatag tattttaata gaaagagatg attttgtagg 360 

tagaaattat aggtattatt ggattataat taggatggta attgtttgtt ttattttttt 420 

tgttattttg acgtagttat attttattgt tttaaagtag attaaaggaa ttggttttat 480 

ggtagaataa tggacgttgg ttttatgtat gaaatttttt taagttgtat ttttatagat 540 

tattttagtt ttaaaagatt taaagttatt acgggttttt ttgatttttt tattttttcg 600 

gaggttttag gatattcggt ttgacggtat ttttttttat ttgttttcgg ggttcgtttt 660 

gcgttgggta gttgcgttta aggagcgcgc gtatttagta tgagtttatt ttttatgggc 720 

gtttttagag gttcggtcgg gtcgcgtata ttaagcgtag taagtttttg tattaatggt 780 

tttcgcggtt gggcggggcg cggtcgttta aaggcgttgt ttttttggaa gttttgtttt 840 

tattttagcg tttgagagtt tgtagtcggt tgggaaggtt tttttggttc gtttggtttt 900 

ttcggggaag aggttaatat tcggtatacg cgatttacgg tagaggggag ttttgggcgc 960 

gtagaattgg ttgcgtttcg tcgagagttt tttgtgggtg gggagagttt ttttattttt 1020 

ttgttcgttt gagcgtttag agtttagggt cgtcgatcgt agtatttttc gatttgttgt 1080 

atcgagggtt cgcggttttt gtgtgcggtt tttatcgttg ttggaggcgg tcgtaggcga 1140 

atcgtcgggt cgttagttac gattcgagtt aggtattttt tcgtttttgg gatcggggtc 1200 

gaaggttaat tatattgtag ttaggttttg cgattggacg gtagtgagag tcgattggtc 1260 

gtcgtcgcga gtttcgggtt tttttttttt tttttttgtt ttaattttag cgtttaagtt 1320 

ttaggttaat gagatagcgt tttatagacg tttttttttc gttttttttt ttttattttg 1380 

gagagggagg ggaagttttg attggttaga ttgttttcgg aagttttttt ttttttatta 1440 

attatcgaag gaggtacgtt tttagcggag ttttttagtt aattataaag cggcggcggc 1500 

gtttagtcgt gtagttttat tagcgttttt gggttttatc gtttttaatt ttttaagttt 1560 

tttcgtaagg gtcggcgatt ttgattggtt attgttgtta ttgtcgaatg ttttttttag 1620 

ttagttatcg aataaggcga atttattttt ttcgttcggt tattttcggt atttttcgtt 1680 

ttttgggcgt ggttttttag cgttattttt taattggttt tttaggttga tcggtttttt 1740 

tcgggaggag tcgtaaataa acgacgttcg tgattggttt cgttcgggtt tcggttttta 1800 

gtcgaagcgg gcgagcgtgg ggttcggtcg gcgattttta gacgtttgtt acgcgggcgg 1860 

cggggcgttg ggcggtgtaa ggttgggtgg gggaggaagg aggtggagga cgagtaggag 1920 

gggggaggag gagtggggaa gtgtaaggcg gttgcgtaga tagcgttttt -tatatagagt 1980 

agtttttgat tcgggcgaat gcgggtttgt gtcgtcgtcg tcgtcgtcgt cgttcgggtt 2040 

aagtgataaa ggaaggaagg aagcgaggag gagtcggttt cgtagtcgtt gatagggttt 2100 

tgggttgggg taaagcgcgg atattttttg agcgggtatc gagtagagtc gaggggcggg 2160 

agggcggtcg agttgttgtc gcggacgggg gagggggttt cgagggacgg aagcggttgt 2220 

cgggttttta tgttttcggc gaatggggaa tagtcgagga gtcgttgttt ggggtttgaa 2280 

gggagttgtt ttcgttatcg ttatggtcgt tggatttagt cgtcgtttgt agttgttttt 2340 

ggcgtaatga ggagaggagt cgtcgttatc gttatcgttc gtttttgatt gattcgcgat 2400 

ttcgtcgttt tttagttcgt cgggtttttg tcgttagttc gtcggatttc gcggtttgtc 2460 

ggagttgtag 2470 

<210> 27 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 27 

ttgtagtttc ggtaagtcgc gggattcggc gggttgacgg taggggttcg gcgaattaga 60 

gggcggcgga gtcgcgagtt agttagaggc gggcggtggc ggtggcggcg gttttttttt 120 

ttattgcgtt aggagtagtt gtaggcggcg gttggattta gcggttatgg cggtggcgga 180 

ggtagttttt tttagatttt aggtagcggt ttttcgattg ttttttattc gtcggggata 240 

tgggaattcg gtaatcgttt tcgtttttcg gggttttttt tttcgttcgc ggtaatagtt 300 

cggtcgtttt ttcgtttttc ggttttgttc ggtgttcgtt taggaagtgt tcgcgttttg 360 

ttttagttta gagttttgtt agcggttgcg gggtcggttt tttttcgttt tttttttttt 420 

tttgttattt ggttcgggcg gcggcggcgg cggcggcggt ataagttcgt attcgttcgg 480 

gttaggagtt gttttgtgtg aggagcgttg tttgcgtagt cgttttgtat ttttttattt 540 

tttttttttt tttttattcg ttttttattt tttttttttt ttatttagtt ttatatcgtt 600 

tagcgtttcg tcgttcgcgt aataggcgtt tgggaatcgt cggtcgagtt ttacgttcgt 660 

tcgtttcggt tgggagtcga agttcgaacg gagttaatta cggacgtcgt ttgtttgcgg 720 

tttttttcgg aaaaagtcga ttagtttgag aaattaatta gaaagtgacg ttgggaagtt 780 

acgtttaaag agcggaaaat gtcgaaggta gtcgaacgga aagggtgaat tcgttttgtt 840 

cggtggttgg ttggaggaga tattcgataa tggtaatagt ggttaattag agtcgtcggt 900 

ttttacgaag aggtttgaag agttgaggac ggtgaaattt agagacgttg gtgaaattgt 960 

acggttgggc gtcgtcgtcg ttttgtgatt ggttggaaaa tttcgttggg agcgtatttt 1020 

tttcggtgat tggtgaagag aagggggttt tcgaggatag tttggttagt taggattttt 1080 



tttttttttt 
attggtttga 
tcgcggcggc 
attggttttc 
attcgacggt 
ggttttcggt 
aagcgagtag 
ttaattttgc 
ttttttcgaa 
cgttaaaatg 
agtcgcgggg 
ttttggaaac 
tgtttagcgt 
ttgaagtttt 
aattagggtg 
ttgttttatt 
taagataata 
gtaattttta 
ttaaaggatt 
ttgggataga 
atttaaatgg 
gataatgggg 
atagttgttt 
gtttaatttt 



taaggtgggg 
agtttaggcg 
ggttaatcgg 
ggtttcggtt 
tcgtttgcgg 
gtagtaaatc 
aggggtggag 
gcgtttaagg 
agggttagac 
ggaa t aaag t 
gttattgata 
gtttatgaga 
aaaacgggtt 
cgaggaaatg 
atttataaaa 
atagaattag 
gagaaagtag 
tttgtagagt 
tatatttgat 
ataaagtaag 
ttatagaggt 
aatgataggg 
aattttagta 



agaagaaagc 
ttggagttgg 
tttttattgt 
ttaggagcgg 
tcgtttttaa 
ggaaagtgtt 
gggttttttt 
tttttttttg 
ggattaaggg 
ttttagaaag 
tagaggtttg 
aatgagttta 
tcgaaagtag 

ggggggttag 

gtgtagtttg 
tttttttggt 
aatagatagt 
tatttttttt 
tagattattt 
ttttgatgga 
tagataagta 
aaaatggtaa 
gattgttgtt 



gaagggaggg 
ggtaaaaggg 
cgtttaatcg 
ggagatgttt 
taacggtgga 
gcggtcggcg 
tatttatagg 
tcgtggatcg 
ggttttttta 
gtagcgtttt 
ttgcgtttgg 
tgttggatgc 
gtgaggggaa 
aggagttcgt 
agaagatttt 
ttgttttaga 
tgttattttg 
tgttaagatg 
ggtttttgtt 
agataggaaa 
aagtaaatga 
attggaatat 
tttaggaatg 



cgtttataaa 
agagggaggg 
taagatttag 
gattcgggtc 
aatcgtatat 
gttttgggtt 
aggttttcgg 
cgtgtgtcga 
gtcggttgta 
tgggcggtcg 
tgtgcgcggt 
gcgcgttttt 
atgtcgttag 
ggtagttttg 
atatataaga 
atagtgaggt 
attgtaattt 
ttgtttttag 
ttgaaatatt 
ggtttagata 
gtgagattgg 
ttatgttata 
tgggtttagt 



gcgttgtttt 
agttcgaaat 
ttgtagtgtg 
gtggttgacg 
agggatcgcg 
ttgagcgttt 
cggggcgtag 
gtgttggttt 
agtttttaga 
cgtttcgttt 
tcggtcgggt 
tgggcgtaat 
gtcggatatt 
gattttttgg 
ttagcgttta 
atggttacgt 
agtggtgttt 
aaagtatatg 
gatgaaggat 
gtgtttgtag 
ttttgtgtaa 
attaaaagga 
gtttttagat 



<210> 28 
<211> 4721 
<212> DNA 

<213> Artificial Sequence 
<220> 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 28 



acgtgaagtt 
ttttattttt 
tgagaaaatt 
taagagaaaa 
tttttttagt 
attgttttga 
tttcgttttt 
tcggttcggt 
tttcgttttc 
ttttaggcgt 

cggggttgtt 

gggatttgcg 
taaaggaagg 
gagaagttta 
ttaatttata 
tgtttgtttg 
tcggtttttt 
tttgggttat 
tttttagttt 
ttcggttata 
ttgtattttt 
gtagtgttcg 
gaatgttttt 
gagggtatgt 
tttttggagg 
tttcggaaaa 
gtgatggggg 
tagttttttg 
gagggggagc 
gtttagtttg 
ggattttacg 
gtgcgtgagc 
cgggtgggga 
tttgttcggg 
tagttcgttt 
acggtttttt 



tagtattttt 
aatttaagat 
tatcgggtat 
taaaggttta 
ttaattgttt 
tatttagttg 
ttacgtttgc 
cggagtcgcg 
ggattttaga 
gtaatttatt 
tatttttagc 
ggcgttcggg 
aatggagaaa 
ggttttagcg 
gatttatttg 
tttttttgta 
atttaatttt 
tggtgttcga 
ttatagtacg 
tattttaacg 
ttgttgattc 
tagatttttt 
tgagtttttt 
atattttttt 
gatggttgta 
tgttgggggt 
ttgtagtata 
attttcgagg 
gagtaggacg 
tagtttaggc 
gtttggagta 
gtttttagat 
tggtggtgat 
tcgtcgttat 
ttttttttat 
tagagacgtc 



ttaagttttt 
aagtaggata 
ttggagtaag 
gttatgagaa 
gttttagttg 
ttagataaat 
ggtcgtagtc 
tattttatag 
tttagttttt 
tttgttttcg 
gtattcggga 
gtgaattttt 
gttgagagga 
tagtaagagg 
tgcggaagga 
ttttggggtt 
ttttgtttta 
cgtgattttt 
ttttttttag 
ttttttcggg 
ggaaatgaag 
cggggtaata 
tttgtttttt 
aggagggtat 
taaacgtatt 
agggggataa 
ttagttcggg 
aatgtttgag 
agggcggttg 
ggtcgggtag 
gtagttttag 
tcgtttcggc 
ttttggtatt 
cggttttatg 
ttttttttgc 
ggacgagcgt 



gtttatatat 
ttttaaaaat 
gagggggggg 
gagagataga 
tgggggttgg 
aaaagaggtc 
gtcgcgttcg 
tttatagggc 
agtaatttcg 
tcggttcgtt 
gatgtatttt 
atattatatt 
taaaaggaaa 
cgataacgtt 
aggaaaaagt 
ggtagagttg 
aaattttaaa 
cgtatttatt 
ttttttttgg 
tgcggggagg 
gcgtttaatt 
taggagaaaa 
ttttattttt 
gtatattttt 
gttttagggg 
ttttttattt 
gttgattttt 
attcggttat 
cgtaggttgc 
taggaggtac 
ttgtcgtttt 
gttttcggtc 
tcgtatcgtt 
cgttcggttg 
gttttttttt 
agatcgtttg 



aaaaagttac 
tatttattta 
agattttgaa 
aatttatgat 
aggtagggtt 
gtttggcgtt 
gttggttcgg 
ggggggcgat 
gtttttcgtt 
cggtattcgc 
aagcgtgttg 
ttgtgtatag 
ggagttttag 
attgaggcgt 
aggggatatt 
aaaggagata 
ggtgcgcggt 
aaaggagtaa 
cgggtttgga 
cgtgattttt 
tttaatcgta 
tttggaaggg 
agtttttagt 
ttaggagggt 
tttttttttt 
agttcgtaga 
tcgtaagatt 
ttgaatattt 
gcgtcggcgg 
gcggtgcgcg 
tttagttttt 
gcgtcggcgt 
ttgtttaggt 
ttcgtcgttt 
tcgttttcgt 
gttttgaata 



gttttagagg 
ttagttggtt 
attaaaaaag 
aatagaggta 
cgaggcggtg 
agtttcggcg 
ttggaaatta 
tatgtttttt 
tcggttcgtt 
gcgggggata 
gggaacgtgg 
agacgtagag 
ggttttagtc 
cggggtttta 
taattttttt 
gatatttagt 
tacgatagta 
atacggagat 
ttaaatttaa 
ttattttttt 
gattgtagga 
aatttttagg 
ttttttttag 
atgtatattt 
agtattggcg 
gtcggttaag 
tatagtaata 
tgtatagtag 
gcgttttcgg 
tttcgtcgcg 
tgagtatcga 
tcgcggttgg 
agtatttagt 
ttttttttcg 
cgtcgggcgt 
tgcggggcga 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



ggaggcgagg agaaaagtcg tttgtttgtt aaggagcgaa tatgattttc gtatattatg 2220 

aagaagtcgg gcgtcgagtt ggggtagtag gcgtaggcga tagtagtagt agtaggggtt 2280 

cgggtaggag cggcggcggt tcgaggggcg tttcgtggta tgcgttagtt tttcggaggt 2340 

cggggcgcgc gcgggggttc gggggcgttc gtcggggatc gggggtttgc gtttcggttg 2400 

ttttatttcg cgcggttcgc gttttgcgcg gtttttcggt ttcggcgcgt ttcggaggaa 2460 

ttcggtcgtc gtttttttgg ggacggtcga gtttgttttc gtcggcgttt ttttaaaaag 2520 

aggttttttc gtagtggttt tcgaatgtcg ggttcgttag tttcggtttt ttatatggaa 2580 

ggttcgcggg ggtaaaaaac gaaaggcgtt cggttgggtt gttggaagaa ggaaaaagtt 2640 

tttttttttt ttgttaagta atttaatttg ggggtgggga gaagtaggta attaaaaaaa 2700 

aaaaagtaag cgatttattt ttttttttta tatttttagt aatcggattt tttcgaattt 2760 

cgcgtatacg aagatttagg ggagggggtc gagtggattt tatttcgtat gagacgtttg 2820 

gtaaaataag aaggttttcg taaaatttaa taattaaata tgtaaagttt taaatgaaaa 2880 

ttattatttt ttcgaatttt agaggtttgg gggcgttcgg ttggaattgg ggtttaaaaa 2940 

aagaaaatgt ttataaagta taataagatg tttgatgggt ggaaaaatgt atttacgagt 3000 

tatatttttt cgtttttttg taaagtttcg ttggtttttt tttttttttt ttttgttaaa 3060 

aaaaaaaaaa aaaatcgtgt atttttttaa tttatagaaa gttttggtta gatcgtttta 3120 

attttgcgta tttgggtggt ttaggggagt ttttggtttt tttttttgcg tttttggggg 3180 

tttaggtttt cggcggggat tttttcgagg ttggcgcggg cgtaggggta gaagatgttg 3240 

cggcggcggt tgagttcggc ggggttgata gcgcggggga gggtggcgcg gcggcggcgg 3300 

agggttttag acgggtcgcg cgttttcgtt tttttcgggt ataagttgtt tgttagtgta 3360 

ggggtcgtcg atgttttttt ttttggtcgc ggttggtcgt cgaggttttt cgtatgggtt 3420 

gttcgtttcg atttagttgc ggcggtagga ggtttcggtg ttttttcggc gttttttttt 3480 

tcgagatttt tttcgtcgcg ttcgggagtt tttttgtttt cggttcgttt ttttttggcg 3540 

tttcggtttt tttgtcgtcg ttaggggttc gtcgcgtcgt atttaggggt ttaggggtcg 3600 

ggcgttcggc ggttcggcgt cgacggattg gttttgtttc gggtagtttt ttttcgcgcg 3660 

gcgagcggat cgagtacggc gttcggttgg ttcggttggc gcggttcggg gataggattt 3720 

ttcgtgcgtc gagtagtaag cgagtgtgtt cggggtttat cgtttttcgt aaggtttttc 3780 

gttttcgttt tttttttttt tttttttttt tttttttttt tttcgttttt agtcgtcgta 3840 

gtcgcgtcgt ttttttttcg ttttttcgta tttttttttc ggttttttgt tcggttggtt 3900 

ttattgcgta gtggcgcgtt cggtttcggt ttcgttacgt ggtcgtttag atattttgtc 3960 

gttttaaaaa aaaaaaaaaa gcgattgtgt ttcgtaaata atagatcggg tttttaaaag 4020 

ttattttttt ttttaatttt tcgttatcgt tatttttttt cgggtttgta gagggggtat 4080 

cgatggaggg gagagagata ggtggggggt agagaagttt ttagaatgga ttgagtttcg 4140 

gtcggagtta tggagaaatt ggaaaagtag ggagtatcga gcgggtttcg tcgcgagttt 4200 

tggagttgag cgagcgggtc ggtggttcga tttcgattcg gttgggtttc gcggtggtta 4260 

tttcgcgcgc gttcgtttta gcgttttatt tattggtttt gttttaaagg ttttggcggt 4320 

ggattttttg gtcgcgttcg agggtaaggg gaggagagcg ttgtttcggt ttaaaagata 43 80 

tttatatcgg attggatcga ggttttggga agtgtgcgtt gaggggaata gtcgtcgagg 4440 

gcggggaggc ggcgtgaata tgattttagc ggcggtcgcg cgtttttttt cgttttttta 4500 

gtttcgggtt tcggttttta ggattgtttg gagaagtgtg ttttgtgtat agttttggaa 4560 

tgtatttggt cggttgacga gttgtgaggg gtagtatttc ggcgggagaa ggggagcggg 4620 

ggtgggggtt cgtttgcgcg tcgcgggtag gttttttttc gggttcggaa gattttcgtt 4680 

attcgttatt ttgtttttcg gcgcgggaag gttatttagc g 4721 

<210> 29 
<211> 4721 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 29 

cgttgggtaa ttttttcgcg tcgggaggta gggtggcggg tggcggaggt ttttcgggtt 60 

cgggaggaaa tttgttcgcg gcgcgtaggc gagtttttat tttcgttttt tttttttcgt 120 

cgggatgttg ttttttatag ttcgttagtc ggttaaatgt attttagagt tgtgtataag 180 

atatattttt ttaggtagtt ttagaaatcg gagttcggag ttgagagggc gggagggagc 240 

gcgcggtcgt cgttgaggtt atatttacgt cgttttttcg ttttcggcgg ttgttttttt 300 

tagcgtatat tttttagggt ttcggtttag ttcggtataa atgtttttta aatcgagata 360 

gcgttttttt ttttttgttt tcgggcgcgg ttaagggatt tatcgttagg gtttttaaaa 420 

taaaattaat gaatgaagcg ttagggcgag cgcgcgcgag atggttatcg cgaaatttag 480 

tcgggtcgaa atcgggttat cgattcgttc gtttagtttt aaaattcgcg gcgaagttcg 540 

ttcggtgttt tttgtttttt taattttttt atggtttcgg tcggggttta atttattttg 600 

ggagtttttt tgttttttat ttattttttt tttttttatc ggtatttttt ttatagattc 660 

gggagggggt ggcggtggcg gggggttggg gggagaaata gtttttagaa attcgatttg 720 

ttgtttgcga aatataatcg tttttttttt ttttttaaag cgatagggtg tttagacggt 780 

tacgtgacga ggtcggagtc gggcgcgtta ttgcgtagtg gaattagtcg agtagagggt 840 

cggagggggg gtgcgggggg gcggggagga ggcggcgcgg ttgcggcggt tgggggcggg 900 

ggagggaagg gggaggaagg gggagggaag ggggcggggg cgggaggttt tgcgggaggc 960 

ggtgagtttc gggtatattc gtttgttgtt cggcgtacgg aagattttgt tttcgagtcg 1020 



cgttagtcga gttagtcggg cgtcgtgttc ggttcgttcg tcgcgcggga gagagttgtt 1080 

cgagatagag ttagttcgtc ggcgtcgagt cgtcgagcgt tcggtttttg agtttttgag 1140 

tgcggcgcgg cgagtttttg gcggcggtag aaggatcgga gcgttaggag agggcggatc 1200 

ggggataagg aggttttcgg gcgcgacgag gagagtttcg gaggaggagg cgtcgagagg 12 60 

atatcggggt tttttgtcgt cgtagttggg tcgaggcgag tagtttatgc ggggagtttc 1320 

ggcggttagt cgcggttagg ggaagggata tcggcggttt ttgtattagt aagtagtttg 13 80 

tgttcggggg gggcgagaac gcgcgattcg tttggggttt ttcgtcgtcg tcgcgttatt 1440 

ttttttcgcg ttgttagttt cgtcgggttt agtcgtcgtc gtagtatttt ttgtttttgc 1500 

gttcgcgtta gtttcgagga agttttcgtc gaggatttgg gtttttagga gcgtaggggg 1560 

aaagattaga gattttttta aattatttag atgcgtagga ttgaagcggt ttagttaaag 1620 

ttttttgtgg attaaaaaaa tatacgattt tttttttttt ttttggtaga agaaaaggag 1680 

aggaagatta gcggggtttt gtaaggaaac ggggggatgt aattcgtgga tatatttttt 1740 

tatttattaa atattttgtt atattttgta aatatttttt ttttttaaat tttagtttta 1800 

gtcggacgtt tttagatttt tgaggttcga ggaggtggtg gtttttattt ggggttttgt 1860 

atatttggtt gttaggtttt gcgagagttt ttttattttg ttagacgttt tatgcggggt 1920 

gaagtttatt cggttttttt ttttgagttt tcgtgtgcgc ggaattcgag gagattcggt 1980 

tattaaggat atagaggaaa aaaataaatc gtttgttttt ttttttttaa ttgtttgttt 2040 

ttttttattt ttaaattaag ttgtttagta agggggaaag aggttttttt ttttttttaa 2100 

tagtttagtc gaacgttttt cgttttttgt tttcgcggat tttttatgta ggaagtcgag 2160 

gttggcgagt tcgatattcg ggagttattg cgggggggtt tttttttggg gaggcgtcga 2220 

cgggggtagg ttcggtcgtt tttagggaag cggcggtcgg gttttttcgg ggcgcgtcgg 2280 

ggtcggagag tcgcgtaggg cgcgggtcgc gcggggtggg gtagtcggag cgtaggtttt 2340 

cgattttcgg cgggcgtttt cgggttttcg cgcgcgtttc ggttttcggg agattggcgt 2 400 

atgttacgga gcgtttttcg ggtcgtcgtc gtttttgttc gggtttttgt tgttgttgtt 2460 

gtcgtttgcg tttgttgttt taattcggcg ttcgattttt ttatggtgtg cggaggttat 2520 

gttcgttttt tagtaggtaa acgatttttt ttttcgtttt ttcgtttcgt atgtttagga 2 5 80 

ttaaacgatt tgcgttcgtt cggcgttttt ggaggagtcg tgcgttcggc ggcgaggacg 2640 

aggaggaggg cgtaggggga ggtggaggag gaggcgagtt gcggggagaa ggggcgacgg 2700 

atagtcgagc gtatggggtc ggtggcggcg gttcgggtag ggttggatgt tgtttgggta 2760 

aggcggtgcg aggtgttaaa ggttattatt atttttattc gttagtcgcg ggcgtcggcg 2820 

cggtcggggg cgtcgaggcg gatttgaagg cgtttacgta ttcggtgttt aagaaattga 2880 

^gga-gcggta gttggagttg ttgttttagg tcgtggagtt tcgcggcggg acgcgtatcg 2 940 

cgtgtttttt gttgttcggt cgtttggatt gtaggttggg ttcgggggcg ttcgtcggcg 3000 

cgtagtttgc gtagtcgttt tcgttttatt cgtttttttt tttgttgtgt aaagtgttta 3060 

ggtggtcgga ttttaggtat ttttcggaag ttaagaggtt gtgttgttgt gaattttacg 3120 

ggaagattaa tttcgagttg gtgtgttgta atttttatta ttttagtcga ttttgcgaat 3180 

taggtgagaa gttgtttttt tatttttaat atttttcgga gcgttagtgt tggaagaaaa 3240 

gtttttaggg tagtgcgttt gtgtagttat ttttttaggg ggagtgtgta tatttttttg 3300 

ggagagtgtg tatatttttt tgggagagtg tgtatatttt tttgggaaga gattgggggt 33 60 

tgaaagtaga gaagggataa ggggaaattt aaagggtatt ttttgggggt ttttttttag 3420 

gttttttttt gtgttgtttc gaaggggttt acgggtattg tttttgtagt ttgcggttaa 3480 

aaattagacg tttttatttt cggattaata agggggtgta gagggaggtg ggggggttac 3540 

gttttttcgt attcggaaag acgttgaggt gtgtggtcga gttggatttg gtttaggttc 3600 

gttaaggaag gttgggagag gcgtgttata aaggttggaa ggttttcgta tttatttttt 3660 

tagtaaatac ggagaattac gtcgaatatt agtggtttag atattgtcgt ggtcgcgtat 3720 

ttttggagtt ttggggtaaa gagagttgga tggaaggtcg aattgggtat ttgttttttt 37 80 

ttaattttgt tagttttagg gtgtagagga ataaataggt aggagaagtt gggtgttttt 3 840 

tatttttttt tttttttcgt atagataagt ttgtagatta gtggggtttc ggcgttttag 3900 

tgacgttgtc gttttttatt gcgttggggt ttgggttttt cggttgagat tttgaaattt 3 960 

tttttttttt atttttttag ttttttttat tttttttttt attttacgtt tttgtatata 4020 

gagtgtgatg tgagggttta tttcgggcgt tcgtagattt tttacgtttt ttagtacgtt 4080 

tgggatgtat ttttcgagtg cgttaggaat gggtagtttc gtgtttttcg cgcgagtgtc 4140 

gagcggatcg gcgggggtag aggtgaattg tacgtttggg gggcgggtcg gggcggggag 4200 

tcgaggttgt tgggagttgg gtttggagtt cgaggacggg gagggggtat ggtcgttttt 4260 

cgttttgtga attgtgaagt gcgcggtttc ggtcgagtcg gtggttttta gtcgggttag 4320 

tcgagcgcgg cggttgcggt cgtagacgtg gaggagcgga gcgtcgaggt tggcgttagg 43 80 

cggttttttt tgtttatttg atagttaaat attaaggtaa ttatcgtttc gggttttgtt 4440 

tttaattttt atagttaaga taaatagttg ggttaaaaga atgtttttgt tattataagt 4500 

ttttattttt ttttttatgg ttgggttttt attttttttt gtttttttaa ttttagagtt 4560 

tttttttttt ttttatttta gatattcgat ggattttttt aaattaattg gtgagtagat 4620 

gatttttaaa atattttgtt tattttaagt tagaaataga gtttttagaa cgtggttttt 4680 

tgtgtgtagg taaaggtttg gggaggtgtt gggttttacg t 4721 

<210> 30 
<211> 2494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA {Homo sapiens) 



<400> 30 



tttttatgaa gttataaagt tgtaatttcg tttttatgtg tagtattata ggttttgttt 60 

ttaaatattt gtaggtgtat ttttttgtat tcgttaaatt aagggttttt tttttttgtg 120 

ttagattcgt ttttacgatt acgtttttta tttgatttat agaaagggtt ttacgatcgt 180 

taggttatcg tcgtttttgt attttattgg ggagtcgaag gttttagcgt tgttcgcgat 240 

ttttttaagg tggtcgggaa ggttttaatt acgttttttt gtaaattaga taaggaaggt 300 

ttggggattt cggtcgcgga aaggtataga ggaggtgaac gtttcggatg tttaagtgac 3 60 

gtatttgtag gcggattcgg aaattcgtgt ttttgagtcg gaggcggagt ttttagattt 420 

agatcgattt tttagaatcg ggcggttttg ttcgaggtat gaaggttttt taggttcgtt 480 

tttaaaagtt gtgtataatg gagatttttg gaataggtat atatttttat tgtaattaga 540 

acgagatgta atatttaaga cgtagggtag ggttttggta aagatacgtt gggggaacgt 600 

gaaagaaagg gttaaaatta cgtagttggt ttttagagat taaagattac gatttttagt 660 

atttaatgta gtgaattttt aggacgttgt atgtcgggag ttgtagtttt ttattatcgt 720 

tattttttcg ttttttcgtt atttgaaagg gttttagggg atttgtaatt tttttcgtgt 7 80 

gttttttttt ttcgagaagc gtcgttatac gagaaagttg gtcgcgaaag tcgtgttgga 840 

attattttta acgaaatttt aggtatagat gggaaagggt gaagaatacg ttgttatggt 900 

tatcgttttt tcggttacgg aataaacgtt ttttaggatt cggaagtagt ttcgtcgcga 960 

tttttttaaa aggatggatg tgttttttgt ttatatttat tggacgtttt tttttagagg 1020 

ttaaggtcgt ttaggtaaag gggcggtttt acgtgtgagg ggttcgcgga gttatttgat 1080 

tggagaaaag ttgtaaattt tgattaatcg gaaggagtta cgtttcgggt atcggttatc 1140 

gtatttggat agtttcgatt ggtggatttt cgtttttttt acgaattttt attgggtgtc 12 00 

gtgggtgcgt ggtgcggcgc gattggtggg tttatgtttt tcgtttttcg ttcgcgagaa 1260 

gtgggggtga aaagcggttc gatttgtttg gggtgtagtg ggcggatcgc gcggttggag 1320 

gtgtgaggat tcgaatttag gggtggaggc ggtttttgcg atcgaagggg atttgagatt 1380 

tatcggtcgt acgttatgag ggttttgtgg gtgttgggtt tttgttgcgt tttgttgatt 1440 

ttcggtgagt gattttggag gagtagacgt ttttttttta tatacgcggt cgtttttcga 1500 

aggttttggg ggcgttgaac gtgggagggg ggatttcggg gtttgcggtg ggttaaggga 1560 

cgttattatt tttcgaaaga gggtaaggga attacgttta tttttttttt tttttggaaa 1620 

taaaaaaaga gagtaatatt atttaattgt tttatagatt taatagtatt tcgttcggag 1680 

gttttagcga tttcggggtg gggttttttt gaggttttgg ttttttcggg agttgtgcga 1740 

gttttttttt ttttttcgtt cggaggattt tttcggttat cgtaattttc ggttatttta 1800 

gttttttcgt tttttaattt tttagatttc gatttttcgt aagttcgtgt ttttttttag 1860 

atgttggttt cgggagcgtg aggtttgggg ttagcggaat ggtagtgtta gcgtgtttga 1920 

tttagttttg ttggagttcg gtttttaata tttttttttt tttttatgtt tttgggttta 1980 

attttagttt ttttcgttag aggttgtata aattttattt gtattttgat ttgtgttaac 2040 

gaaggtcgaa gttttggttt gattagttgg gaggttttgg ttcggtttgc gcgtaatacg 2100 

tgtgggtgta gaagaggttt ggtattagtt tgtttaacgt tatttataaa tgatcggaga 2160 

tatattgttt agaagtggtg attgttttcg tattttagga tttattggag attttagtat 2220 

ttgaatttaa atagatgttt tgttttggtt ttatttttgg tatttaaatt agggaatgta 2280 

tttttttatt tttatttttt ttagttatta attgaatttt gtaattttta ttttttttta 2340 

tttgattatt ttttttggag aagtgtgatg ttgaagagtt tgttcgtgtt aaatttttag 2400 

ggtcggttag agttgacgat gaagttgatg tggatggtat agtagaagag gatttgggta 2460 

aaagtagaga aggattaagg acggatgatg aagt 2494 

<210> 31 
<211> 2494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 31 

attttattat tcgtttttga ttttttttta tttttattta gatttttttt tattgtatta 60 

tttatattaa ttttatcgtt agttttgatc gattttgaag atttaatacg ggtaaatttt 120 

ttaatattat atttttttaa ggaggatgat taaataggga agggtagggg ttgtaaaatt 180 

tagttggtaa ttaagggagg tggggatgaa agagtgtatt ttttggttta aatattaaag 240 

gtaaagttaa gatagaatat ttatttaagt ttagatgttg aggtttttag tagattttga 300 

aatacggagg tagttattat ttttaaatag tgtgttttcg attatttata aataacgtta 360 

agtagattga tattaggttt tttttatatt tatacgtgtt gcgcgtaggt cgagttaagg 42 0 

ttttttaatt gattaagtta aaatttcggt tttcgttagt ataggttaga atgtaaatag 48 0 

gatttatgta atttttggcg aggggaattg aaattgagtt tagaggtatg agaaggggga 540 

gaaatgttaa gaatcgaatt ttagtaaaat tgaattaaat acgttgatat tgttatttcg 600 

ttggttttag gttttacgtt ttcgaggtta gtatttaaga agggatacgg gtttgcgagg 660 

agtcggagtt tggggaatta ggggacggga aagttgggat ggtcgaaggt tgcggtggtc 72 0 

gaaaaagttt ttcgggcggg gaggggggag gggttcgtat agttttcggg ggagttagag 780 

ttttagagag gttttatttc gaggtcgttg agattttcgg gcgagatatt attagatttg 840 

tagggtagtt agatgatgtt gttttttttt tttattttta gaggaagaag agaataaacg 900 

taattttttt gttttttttc gaggaatggt gacgtttttt ggtttatcgt aggtttcggg 960 



attttttttt ttacgtttaa cgtttttagg attttcgaga agcggtcgcg tgtgtggagg 1020 

ggggacgttt gtttttttag aattatttat cgaaggttag taggacgtag tagaggttta 1080 

gtatttatag ggtttttatg gcgtgcggtc ggtgagtttt aagttttttt cgatcgtagg 1140 

agtcgttttt atttttgggt tcggattttt atatttttag tcgcgcggtt cgtttattat 1200 

attttaagta ggtcgggtcg ttttttattt ttatttttcg cgggcggggg acgggaaata 1260 

tgaatttatt aatcgcgtcg tattacgtat ttacggtatt taatgaggat tcgtgagggg 1320 

ggcggaagtt tattaatcgg agttgtttag gtgcggtgat cgatgttcga agcgtggttt 13 80 

ttttcgattg gttagggttt gtagtttttt tttaattaaa tggtttcgcg ggttttttat 1440 

acgtgggatc gtttttttgt ttgggcggtt ttggttttta agggaaaacg tttaatgaat 1500 

gtaagtagag aatatattta cttttttaga gaggtcgcgg cggaattatt ttcggatttt 1560 

agagagcgtt tatttcgtga tcggggaaac ggtagttatg gtaacgtgtt ttttattttt 1620 

ttttatttat gtttggggtt tcgttggaag tgattttagt acgattttcg cggttagttt 1680 

tttcgtgtgg cggcgttttt cggaaagaag aaatatacga gagaggttgt aaatttttta 1740 

gaattttttt agatggcggg ggggcggggg ggtggcggtg gtgagaaatt atagttttcg 1800 

gtatgtaacg ttttaggggt ttattgtatt gaatgttggg ggtcgtagtt tttggttttt 1860 

gaggattagt tgcgtagttt tggttttttt ttttacgttt ttttagcgtg tttttgttag 1920 

agttttgttt tgcgttttag atgttatatt tcgttttgat tatagtagag atatgtgttt 1980 

attttaggag tttttattgt atatagtttt tagaagcggg tttagaaagt ttttatgttt 2040 

cgggtaaggt cgttcggttt tggggggtcg gtttgagttt gggggtttcg ttttcggttt 2100 

aagagtacga gttttcgaat tcgtttataa gtgcgttatt tgggtattcg gagcgtttat 2160 

tttttttgtg ttttttcgcg gtcggaattt ttaggttttt tttatttggt ttatagaaga 2220 

acgtggttgg gattttttcg attattttag ggaaatcgcg agtagcgttg aggttttcga 2280 

ttttttaata aagtataaag gcggcggtga tttgacggtc gtagaatttt ttttgtgaat 2340 

taggtggaga gcgtggtcgt aaaagcgggt ttagtataga aaaggaaaat ttttagttta 2400 

acgagtgtag gaggatgtat ttgtagatat ttgaaagtaa aatttgtggt gttgtatatg 2460 

agaacgggat tgtagttttg taattttata gaaa 2494 

<210> 32 
<211> 2497 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 32 

tgaagattgt gtggagtggg aggtttgtgt ttattttttt tatttattgg ggaatgggtt 60 
gagtggttgg gggtaaagtg ggatttaggg gttttgttta gtttttattg taggttaagt 120 
ttttttttgc ggttaagagt tattgggttt tttatttttt tttggttttt attcgtagat 180 
ttagatggtt ttatggtgat ttagaggagg atgtattcgg gcgttttttt tgaggatggt 2 40 

tgtagggatt atttagggat tgagtcgagg tgtagatata gttacggatt tgggtgttgt 3 00 

tagatataga agggtgtgta gtaaagtgag ttttacgtat gcgtagattt ttttttcggt 3 60 

ttttgttttt tatggaaata taattgatta tattatgatt gattacggta gttttttagg 420 
gttttttttt tttttttttt tttttttttt agaagttata aagggatata gaatatattt 4 80 

aaagattatt ttatgttttt agtggagagg taaaaaaggt tcgtttcgga atggggaagc 540 
ggttgcgttt tggacggaga ggggcgggga tttcgcgatt gtaggcggag ggagggcggg 600 
tgtcgttggc gtaggcggtg atagggagat atcgtcgtta ttgagtattt ttatgtaagt 6 60 

ttttttattt ttttgtgtat tagtgtttat attggggtaa tgataaatgt tgtgttgaaa 7 20 

aattatttga tggggttatg gaaggaacgg aaggaacggc gttttggttc gttcggggtt 7 80 

cgcgtacgtc gttattaagt cgcgggggcg ggtcggaggg gagagttgcg ttagttaggt 840 
cgttgttaga tgacgagttc ggggcgtgac ggggtggagt atttttaaaa aagtgtatgt 900 
ttaggatttc gtttagtgta tttttgcgcg cggcgggtcg ggttgggtag ttttataaat 960 

agtcgtggtg tgagtttcga agggaattat tagcgttttt tgtttacgga gttttaggtt 102 0 

tataatggta gcggtcgtta gtttcgcgtt ttttttgtgt ttttcgtttt tgtatttgtt 1080 

gtttggttgg tttcgggtag gatgggtcgg tgagttcggg gatgtagttt aagtagggcg 1140 

ggggttaaat ttgggaggtt gtggattgta gcgggtttta gaggagggga ggtttttgga 12 00 

aggatcggcg cgattttttt gaacgaatat cgcggttttt tcgaacgtcg cggttttttc 1260 

gaacgtcgcg gttttttcga atatcgcggt gttttcgaat atcgttgttt tttcgaatat 1320 

cgcgattttt tcgaatatcg tgattttttt agatatgttt agttgaaggt atttagtttt 1380 

tttcggtggt tttttttcgt cgggttcgtt ttttgcggtt gttgtttgtt ttttaggtta 1440 

ggaggttttt ggaaggatcg gtgttgtttt ttcgaatatc gtggtttttt cgaatatcgc 1500 

ggttttttcg aatatcgttt ttttttcgag taacgcgatt ttttcgaata tcgcggtttt 15 60 

ttcgaaaatc gcgatttttt cgaatattgt tattttatcg aatatcgcga tttcgtcgaa 1620 

tatgttcggt tgaaggtatt tagttttttt tcgcggtttt ttttcgtcgg gtttgatttt 1680 

tgcggttgtt gtttgtttcg taggttagga ggtttttggt agtatcggcg cgatgttttc 1740 

gaatatcgcg ttttatttta atatcgcgat tttttcgaat atcgtgattt ttttcgaata 1800 

tcgtcgtttt ttcgagtaac gcggtttttt cgaatatcgc ggttttttcg aatatcgcgg 1860 

tattttcgaa tatcgtcgtt ttttcgtata ttgcgatttt tcgaaatatt gcgatttttt 1920 

cgaatatcgc gatttcgtcg aatatgttcg gttgaaggta tttagttttt tttcgcggtt 1980 

tttttttttc gggttcgttt tttgcggttg ttgtttgttt tataggttag gaggtttttg 2040 



ggtggattag cgcgattttt tcgaatatcg cggtttattc gaatatcgcg gttttttcga 2100 

atatcgcggt tttttcgaat atcgcgattt tttagaatat cgcggttttt tcgaatatcg 2160 

cggttttttc gaatatcgcg attttttaga atatcgcggt ttattcgaat atcgcggttt 2220 

tttcgaatat cgcggttttt tcgaatatcg cgatttttta gaatatcgcg gttttttcga 2280 

atatcgttgt tttttcgaac gtgtttggtt gaaggtattt agtttttttt cggggttttt 2340 

tttcgtcgag ttcgtttttt gtagttgttg ttagtatcgt agtttagggg gagtgttaaa 2400 

gaaggttgaa aaggaattgt aggagggtgg agggattaaa aggttataga gggtaaggta 2460 

gggcggggat ttttggtgta gattcgtagt tttattg 2497 

<210> 33 
<211> 2497 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 33 

tagtggggtt gcgggtttgt attagggatt ttcgttttat tttgtttttt gtagtttttt 60 

ggttttttta tttttttgta attttttttt agtttttttt gatatttttt ttggattgcg 120 

gtgttagtag tagttgtagg aagcggattc ggcggaaagg agtttcggag gggaattgag 180 

tgtttttagt taggtacgtt cggggagata gcgatgttcg gggagatcgc gatgttttgg 240 

gggatcgcga tgttcgggga gatcgcgatg ttcggggagg tcgcgatgtt cgggtagatc 300 

gcgatgtttt gggggatcgc gatgttcggg gagatcgcga tgttcgggga ggtcgcgatg 360 

ttttggggga tcgcgatgtt cggggagatc gcgatgttcg gggaggtcgc gatgttcggg 420 

tagatcgcga tattcgggga gatcgcgttg gtttatttag aagttttttg gtttatgggg 480 

taagtagtag tcgtaggaag cggattcgga ggaaaggagt cgcggagggg aattgagtgt 540 

ttttagtcgg gtatgttcgg cgagatcgcg atgttcgggg agatcgtaat gtttcggggg 600 

atcgtaatgt acggggagac ggcgatgttc gggggtatcg cgatgttcgg ggggatcgcg 660 

atgttcgggg agatcgcgtt attcgggggg acggcgatgt tcggggggga ttacgatgtt 720 

cggagggatc gcgatgttgg ggtagaacgc gatgttcggg ggtatcgcgt cggtgttatt 780 

agaagttttt tggtttgcgg ggtaagtagt agtcgtagga attagattcg gcggaaagga 840 

gtcgcggagg ggaattgagt gtttttagtc gggtatgttc ggcgagatcg cgatgttcgg 900 

tgagatggta atgttcgggg agatcgcgat tttcggggag atcgcgatgt tcggggagat 960 

cgcgttgttc ggagagaggg cgatgttcgg agaggtcgcg atgttcgggg agattacgat 1020 

gttcggggag atagtatcgg tttttttaga aattttttgg tttgaggggt aagtagtagt 1080 

cgtaggaagc ggattcggcg gaaaggagtt atcgagggga attgagtgtt tttagttggg 1140 

tatgtttggg gagattacga tgttcgggga gatcgcgatg ttcggggaga tagcgatgtt 1200 

cgggggtatc gcgatgttcg gggagatcgc gacgttcgga gggatcgcga cgttcgggga 1260 

gatcgcgatg ttcgtttagg gagatcgcgt cggttttttt agaagttttt ttttttttga 1320 

aattcgttgt agtttataat tttttaggtt tggttttcgt tttgtttagg ttatattttc 1380 

gaatttatcg atttattttg ttcgggatta gttagatagt aggtgtagaa gcgggaggta 1440 

tagaaggaac gcggggttgg cggtcgttgt tattgtagat ttggagtttc gtggatagga 1500 

ggcgttgatg gtttttttcg aggtttatat tacggttgtt tataaagttg tttagttcgg 1560 

ttcgtcgcgc gtagggatat attgggcggg attttaggta tgtatttttt tggggatgtt 1620 

ttatttcgtt acgtttcggg ttcgttattt gatagcggtt tggttgacgt aatttttttt 1680 

ttcgattcgt tttcgcggtt tggtggcggc gtgcgcgggt ttcgagcggg ttaggacgtc 1740 

gtttttttcg ttttttttat ggttttatta aataattttt taatatagta tttattatta 1800 

ttttagtgta aatattgatg tataggaaga tgaagaaatt tgtataggaa tatttagtgg 1860 

cggcggtgtt tttttgttat cgtttgcgtt agcgatattc gttttttttt cgtttgtagt 1920 

cgcgaagttt tcgttttttt tcgtttaggg cgtagtcgtt tttttatttc ggagcgaatt 1980 

ttttttgttt ttttattaaa agtatgaggt ggtttttgag tatattttat atttttttat 2040 

gatttttgga aaaaaaaaaa aagaaaagaa agaaagtttt agagggttat cgtgattaat 2100 

tatgatgtga ttagttgtgt ttttatggag agtagaagtc gagggaagag tttacgtatg 2160 

cgtggggttt attttgttat atattttttt gtgtttaata atatttaaat tcgtggttgt 2220 

atttgtattt cggtttagtt tttggatggt ttttgtagtt atttttaaaa gggacgttcg 2280 

agtgtatttt tttttgagtt attatgaaat tatttagatt tgcgggtaga agttagaggg 2340 

aggtggaggg tttagtggtt tttggtcgta agggagggtt tagtttgtag taaggattgg 2400 

gtagagtttt tgagttttat tttgttttta gttatttagt ttatttttta atgaatgaga 2460 

aaaataggta tagatttttt attttatata gttttta 2497 

<210> 34 
<211> 3095 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 34 



ggttattata ttcggtagcg gggtggtttt tagtttttgt tgttttttcg tttttttttt 60 

tcgttcgttt tcggagttta gtcgattttt gaggttttaa ttttatttat ttttttttcg 120 

ggtcgtcgtc gtcgcgtttt tttttatttt tattttttcg aggagagtta taggttgtaa 180 

atttaattaa tttcgtaatt tatttttgta aaattattta taaagatttt tttttcgcgt 240 

tcgcgttcgt ttttttcgcg tcgggttttt ttagttacgg ttataaagtg tttttttttt 300 

tttttgagtt ttgtatataa ggaacgcggg ttggggtttt gttcgttttt tttttcgttt 360 

aaggtaagga tttcgggaat ttgaagtttg gcgtttatta cgtttaggtt cgtagttttt 42 0 

tttttataga gtttgtatta tgggaaaaaa taaaataaaa tttaggaaag ggaggtaata 480 

gttattggga gttaatatag agttacgtag cgtttaaaat ataaatatcg tagcggttag 540 

aaatttcgtt atttttttcg tttttttagg ttgttttgtc gaggtttttt gagttttttc 600 

gtatattgaa aggtatcgta ggtgtagtgc gtattttttt tttatttatt ttaagaagtt 660 

ttgtttcgtt attagttttt tttttcggga tgagtaggga gagcgcgcgg aggttttcga 720 

ttttttcgat tataattaag aaagaataat ttttaaagtg tttaatattt tcgtttttaa 780 

gttttttaaa atataggggt agggaatatt aaaatattcg gtttttatta ggaagattac 840 

ggttttgaaa ggaaatagta gatacgatat tttattttat ttggatttat gattaaaaaa 900 

ataaaaataa aaatttaaag agttcgtttg tatttttttt ttttaaattt cggttcggtt 960 

cgaaggtagg gaatttaaaa gatcgaggtc gatggaagag agttagcggg gcgagcgagc 1020 

gggtagtttt tttttttgtt tttcggagtt atttagaagg ataggggaag ggaaggaaga 1080 

agaggcgagg aaaaagagga gggagggaag cggaggttag gagcgacgga gtaaggaaag 1140 

tagtttgtaa gcgagaaaag agggaaaaaa tatagtcgta cgaatttaga gagattataa 12 00 

gtcgtacgta agtagtagta gaaagagcga gagcgcgagc gcgcgttttt ttcgcggttt 1260 

ggggttagat agtttttaga ttagttcgaa ttatttttta agtattgttt cgtttttttt 1320 

gtttcggtcg ttttttaatt tttttttttt ttttttttta tttttttttt aaaaattaaa 1380 

ataatataag ggagggtggt aaaagttttt ttaaatcggt cgatttattt aaagataata 1440 

ataataataa taaatatata ataatttata ttttatggtg ggagagacgt gggattaatt 1500 

ttcggtattt attttaatat ttgatagtta gaataaataa atatatatat ttatattaat 1560 

agatatatat agaaaatttg gagttaaagt atttggtaag agcggaaaaa aaaagaatta 1620 

aaaggtaaaa taatgattat gagtagcggc ggcggtagcg gtattagcgg taatagcggc 1680 

ggcggcggta gtagtagtag tagcggcggt agtaatagta ataattattt ggtgttcggt 1740 

tttttttaga aattttttgt attattattt ttaagaattt tagttttaag aattaataga 1800 

gtttaatttt cggaatttga gtttcggatt ttattattgt tacgtggtag gggaggattt 1860 

ggtgttagtt tttcgagatt tttattgttt ttggttaatt aaaagttttt aaagttataa 1920 

gattttttta ttggtcggta tatttcgagg tttttataag tagagcgttt cggatttgga 1980 

ggtttcggtt cgaggttcga ggggtttgaa ggtggttttt ttttttcggg tttaagacga 2040 

tggtatggtt tgtttcgtta ttattacgtg ggtttttttt ttgtgacgtc ggcgttttcg 2100 

ttgtagtaaa gttcggtttt tggaattttg agaattaatt tgttattcgg tgatataaga 2160 

gggggagtgc gttttgtttt ttcggggttt ggggttaatt tttttttttt tatttataaa 2220 

tttagtagat cgagttaaat gtataaaagg gagcgagagg tttgaattat tgggaaaagt 22 80 

atgttatata tatagtaggg ttagagaggc gagtaagaga aaaataaaat aaaataaata 23 4 0 

ttatagtttt ttttaattag aatattaggt attacgagaa aaatatttgt taagtagttt 2400 

tcggtgggtt tatttgtttt atttttattt aggatagggg tttttgttgt tgttttgggt 2460 

tttttttttt ttggtgtggt ggtttgggat ttttggtttt tgtattttga tggtttatgg 2520 

atttttgttt ttgatttttt gttttttgta agtttgtggt gttacgtaaa ttataggatc 2580 

ggtatcggtt ggattttttt gtacgtgttt tttttttttt tatttaattt tttaagcgtt 2640 

ttaaagatgt attattttaa tattaatatt attgaaagaa gtttaaattt ttggttatat 2700 

gtaataattt tagtttttat tttttttttt tttttttttt ttggtgtaat tttttttttt 2760 

ttttttggat ttttgttgaa gtgtgttttt tttgtatttt agagaaatgt ttaaaggatt 2820 

tgttttggtt tggtttgttt ttttttagga tagtaagtgg tgggtttaat ttgttattgt 2880 

tgatttttgg gaaatttttt gttgtaagaa acgtgtgtgt gggggggagg gtgggggtgg 2940 

cggggtggta tgtgtgtgtt ttttataaaa ttttgtgagt taaatatttg tttgtgtttt 3000 

gttttttttt aaggttttga gatttttgtt ttcgaggttc gttttaaggt cgttgtaaaa 3060 

aaattttttt agtttgtgtt taagagatta gtcgg 3095 

<210> 35 
<211> 3095 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 35 

tcggttgatt ttttaaatat agattgaaga gattttttta taacgatttt gaaacgagtt 60 

tcgaaaataa aaattttaag attttaagag aaaataaaat ataaataggt atttggttta 120 

tagaattttg tagaaaatat atatatatta tttcgttatt tttatttttt tttttatata 180 

tacgtttttt gtaataagaa attttttaag agttaataat aatagattaa atttattatt 240 

tgttgttttg gaaagaaata aattaaatta aaataaattt tttgaatatt tttttgaagt 300 

gtaggagaga tatattttag taaaagttta agggggaaaa agaaaattgt attaaaggaa 360 

aaaaaaaaaa aaaaagtggg ggttgggatt gttatatatg gttaaaaatt taagtttttt 420 



ttaatagtat 

agaaaaggta 

aaatttgtaa 

taaaaatttt 

gttttaaata 

gtggtgttta 

tattcgtttt 

cgtttttttt 

gttttagatt 

gtttttagaa 

gagtttacgt 

ttatttttag 

gaggatttcg 

gttaggggta 

ggtaaagttc 

tttagaagtg 

ttgttgtcgt 

tcgtcgtcgt 

taaatgtttt 

tattttagtt 

taggatatag 

tttggggagg 

gagaggaagg 

ggtgattcgg 

cgttttcgtt 

ttgtgttttt 

tttcgttttt 

tgggtaattt 

ttatcggttt 

aaatgtaagc 

atgaagtatc 

gttttggtgt 

tgaaaattat 

gtttatttcg 

ggtgcgtatt 

gatagtttgg 

gggcgttgcg 

ttttattttt 

gacgttaggt 

tttagttcgc 

gttgagggga 

gattttgtaa 

gagtaagaat 

gttttagaaa 

agttaggggt 



tagtattgaa 
cgtataaaaa 
aaggtaaaaa 
aagttattat 
aaaataaagt 
atattttagt 
tttaatttta 
tgtgtattta 
tcgggaaagt 
ttttagaggt 
gatggtggcg 
gtttttcgag 
aaatatgtcg 
gtaaaaattt 
gaagtttaaa 
gtgatgtaag 
cgttgttgtt 
tgtttatgat 
ggttttaagt 
gttaggtgtt 
attgttatgt 
tttttgttat 
aggggaatta 
gttagtttgg 
ttttttgttg 
tttttttttt 
tttttttttt 
cgggaggtaa 
cggtttttta 
gaattttttg 
gtgtttatta 
tttttgtttt 
ttttttttgg 
aggagagaga 
gtatttgcga 
gagaacgaga 
tgattttgtg 
ttttatggtg 
tttagatttt 
gttttttatg 
ttcggcgcgg 
aaatagattg 

gggggaaggc 

tcggttgagt 
tatttcgttg 



ataatatatt 
aatttaatcg 
attagaagta 
attagaaaga 
aaatgaattt 
tggaaagagt 
ttatatatat 
gttcgatttg 
aaagcgtatt 
cgagttttgt 
gagtaggtta 
tttcgaatcg 
gttagtgaaa 
cggagagttg 
tttcgagaat 
aagtttttag 
gttattgtcg 
tattatttta 
tttttatgtg 
aaaataaatg 
atttattatt 
ttttttttgt 
gggggcggtc 
gggttgtttg 
ttgtttgcgt 
ttcgtttgta 
ttttttttcg 
aaagggaggt 
gattttttgt 
ggtttttgtt 
ttttttttta 
tgtgttttgg 
ttgtagtcga 
ttgatggcgg 
tgttttttag 
aaggtggcgg 
ttggttttta 
taagttttgt 
cgaggttttt 
tgtaagattt 
gaggagcggg 
cgaggttggt 
gcggcggcgg 
ttcgggggcg 
tcggatgtag 



tttaaaacgt 
atgtcgattt 
aaaatttata 
aaaaaattta 
atcgaaaatt 
tgtgatgttt 
aatatatttt 
ttgagtttat 
ttttttttta 
tatagcgaag 
tattatcgtt 
gaatttttaa 
aaattttgtg 
atattaagtt 
tgagttttgt 
gaaaggtcgg 
tcgtcgtcgt 
ttttttaatt 
tatttattga 
tcgaagatta 
attattgttg 
gttgttttgg 
ggagtagaga 
gttttagatc 
acggtttgtg 
aattgttttt 
tttttttttt 
tgttcgttcg 
tttcgagtcg 
tttgtttttt 
gagtcgtgat 
ggagtttggg 
gggagtcggg 
gatagggttt 
tgtgcggggg 
gatttttggt 
atggttgttg 
aaaaagggaa 
attttgggcg 
aggaggagag 
cgcgggcgcg 
tggatttgta 
cggttcgggg 
ggcggggaga 
tgatt 



ttgagggatt 
tgtgatttac 
aattattaaa 
gaataatagt 
gtttggtaaa 
attttatttt 
ttttagtggt 
gggtaagaaa 
tgttatcgaa 
gcgtcgacgt 
ttgggttcgg 
attcgagacg 
gttttgaggg 
tttttttgtt 
tgatttttag 
atattaggtg 
tgttgtcgtt 
tttttttttt 
tataaatgta 
gttttacgtt 
tttttgagtg 
tttttggaaa 
ggacgagata 
gcggagagga 
atttttttgg 
tttgtttcgt 
tttttttttt 
ttcgtttcgt 
aatcgagatt 
tggttataaa 
ttttttaatg 
ggcggggatg 
aattttcgcg 
tttggggtgg 
gatttaggga 
cgttgcggtg 
tttttttttt 
ttgcgggttt 
aggagaaaga 
aagggtattt 
aaagggagat 
atttgtggtt 
agggagtggg 
aagggcgggg 



agatagggaa 
gtaatattat 
atatagaaat 
aaaaattttt 
tatttttttc 
attttttttt 
ttaaattttt 
gaaggaatta 
tagtaaatta 
tatagaggag 
ggagggagag 
ttttgtttat 
tttttggttg 
acgtagtagt 
aattggggtt 
attattgttg 
ggtgtcgttg 
tcgtttttgt 
tatatttatt 
ttttttatta 
aatcggtcgg 
ggaggtggag 
gtgtttgggg 
cgcgcgttcg 
attcgtgcgg 
cgtttttggt 
tttgtttttt 
tggttttttt 
tggaaggaaa 
tttagatgag 
agagtcgagt 
ttgaatattt 
cgtttttttt 
gtgggaaagg 
atttcgatag 
tttgtatttt 
ttaaatttta 
gagcgtagtg 
cgaatagagt 
tgtggtcgtg 
ttttgtgagt 
tttttcgagg 
tagagttgga 
gggtagtagg 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3095 



<210> 36 
<211> 2229 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 36 



ttttttttcg 
ggtgatagaa 
tgtttttttt 
tgagggattt 
tattgggtga 
tagagaagag 
tttcggttgt 
gggttttgtt 
ttattcgaat 
taattcgatg 
cgggcgtata 
atcgttttat 

ggggggttaa 

ttttttttcg 
tattaaggta 



gcgttggttg 
cgtgttgggg 
tggatatagt 
gtttttaggt 
ggaaggagta 
gtgtgtcggg 
agttgggaag 
tggaagggtt 
tttttcggac 
attgttattt 
tggttaggtg 
tttaaatcgt 
ggacgttgag 
gtatatattt 
gggcgcgttt 



gtgcgggttg 
gtgggggtcg 
ggaagttttt 
agtacgtaag 
ttttgaatgg 
ttgtattttt 
gtttattggt 
gttttttatt 
ggtattatta 
cggtgaggat 
gcgtttggta 
tgtttgggtt 
gtttcggggg 
ttttatttat 
tttttatgaa 



gggttaggtg 
gggttagggt 
ttcgtattat 
ttgttgtttc 
agatgggggt 
ggaggtcgcg 
tggaaagagg 
tgtttagtgg 
tatagtcggg 
ttgtgttgat 
ggcgacgttc 
taggtttttc 
taggaagttt 
ttatttcgtt 
ttattttaag 



gagaagtcgt 
cggtgtaatt 
taaatttttg 
gggtttattt 
gttttcggtt 
gtaattgata 
tgtttttttt 
tattatagga 
ttttcgtagt 
ggtcggagaa 
gggtgtagga 
ggttttttga 
tttttggtta 
tattttcggg 
gtttttgagt 



tttttgttaa 
agggggtcgt 
ttattttttt 
cgtatttttt 
tatatatttg 
ttagagaaga 
ttttagtaaa 
cggtcggttt 
gttggttttt 
ttttgcgttg 
cggcgttttt 
ataggggttt 
agcgtttttt 
gcgagaggtt 
cgcgggggtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



tcgggtaatt attttttttt ttttttggtt ttaggtattt tagtttaggg gtttgtagag 960 

aagttcgaag ttcggataaa cgcgtcggac gttaataatt ttttattttt ggtagtagta 1020 

aaggttaata tatttttatt ttttatttta gtttgttatt aaaataaagt tgcgcgcggt 1080 

tgagggtagg aaggcgttga gatcgagaag aagggacgtt tcggagaaag tgcgtttagt 1140 

tgattttaga aattagagtt tttcgggatt tcgtcgagat tttttgtagg gcgttttaat 1200 

ttgttttttt attgcgtgtc ggcgtcgtag cgcgtgcggt ttagggtttg gtgatttcgg 1260 

tttagttcgg cggtcgcggc gaggtttttg gcgtagtcgt ttggaatttc gtattagaat 1320 

cgggatcgcg taaatgtttt ggttgaagtg ttattttatt taagaaatat tgttgttagg 1380 

aataaaatgg ggttttcggt gtttcgaagt attttttgaa atttttttaa aataatttat 1440 

aaaaaatgtt tttgttttaa cgttttataa cgtttaagga aatatgtaaa tggtttgttt 1500 

ttttatcgag atggtcgttt taattaatag tgtatatata tataataatt tttttaattt 1560 

ttttttttag agttaagtat tttattatat gtaaattata ataaagaaaa gattgtgtaa 1620 

gattatgtaa gtcgattgat ttaaaatatt gagttttaat ttaggttttt tgttttttta 1680 

tttaataatt tttgtgtttg gattagattg gtgaagtagg ttatggaaat taataaagta 1740 

aaaaattaaa agtatttttt ttcgttattt ttttttttaa aattaaataa tagtcgtttt 1800 

tttttgagta ggttttagtt ttaggttcga gtttttttgc gattatttta tagttattta 1860 

tagtagttgt tgttgttttt gtcgggtttt cgtttttgtt ttttttgggt cgttttttgt 1920 

atataaaata tattttagtt ttttaattaa atttaaatac gatttcggta gaatttatat 1980 

atttcgtggt gtatggattg tgtcggtgta ggggaaataa atattttttg gtatttaatt 2040 

attgagttta attcgaaaaa tcgggattgg gtttttaggc ggtattttag gggttttaat 2100 

ttggttcgcg tttttttaga ttttggcgtt gagagcgttg tttttgcggg tgggtggacg 2160 

gagaggtaat aatttgtttt taataaaaat ttgtcgttat cgaatcgaaa gcgaaaggga 2220 

agggagaag 2229 

<210> 37 
<211> 2229 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 37 

tttttttttt ttttttcgtt ttcgattcgg tggcgatagg tttttgttga aagtagattg 60 

ttattttttc gtttatttat tcgtaaaagt agcgttttta gcgttaaggt ttggggaggc 120 

gcgggttagg ttggagtttt tggggtgtcg tttaggggtt tagtttcgat ttttcgaatt 180 

agatttagtg gttaaatatt agagggtatt tatttttttt gtatcgatat aatttatgta 240 

ttacgaaatg tgtaaatttt gtcggggtcg tatttgaatt tagttagaga attggggtgt 300 

gttttgtata taagagacga tttaaagaag gtagaaacga aaattcgata gaagtagtaa 3 60 

tagttgttgt gggtgattgt ggggtgatcg taggaaaatt cgagtttggg attgaagttt 420 

gtttaggaag gggcgattgt tgtttaattt tggagggagg gatggcgaag gaagatgttt 480 

ttaatttttt attttgttaa tttttatagt ttgttttatt agtttggttt aaatataaaa 540 

gttgttaaat agaaaaatag agggtttgga ttaaaattta atattttaag ttaatcgatt 600 

tgtatgattt tgtataattt tttttttatt ataatttata tatagtgaag tgtttagttt 660 

tgaggaggaa agttggaaga attgttatgt atgtgtatat tgttagttag gacgattatt 720 

tcgataaaga aatagattat ttatatgttt ttttaaacgt tgtaaaacgt taaagtaaaa 780 

atattttttg taagttgttt taagaaaatt ttagaagata tttcggagta tcggggattt 840 

tattttgttt ttgatagtag tgttttttga atagggtgat attttagtta gggtatttgc 900 

gcggtttcga ttttaatgcg aagttttaag cggttgcgtt aggaatttcg tcgcgatcgt 960 

cgggttaagt cggagttatt aagttttgag tcgtacgcgt tgcgacgtcg gtacgtagta 1020 

ggaaaataga ttaaaacgtt ttatagaaaa tttcggcgaa gtttcggagg attttggttt 1080 

ttaagattag ttgggcgtat ttttttcggg acgttttttt ttttcggttt tagcgttttt 1140 

ttgtttttag tcgcgcgtag ttttgttttg gtggtaaatt gaaataagaa atggaaatat 1200 

attggttttt gttgttgtta gggatgagag gttgttgacg ttcggcgcgt ttgttcgggt 1260 

ttcgggtttt tttgtagatt tttggattgg ggtgtttgag gttaggagag gagggggata 1320 

gttgttcgga gttttcgcgg tttagaggtt ttgggatgat ttatgggggg ggcgcgtttt 1380 

gttttggtga gtttttcgtt tcgagggtag gcgaggtggg tgggtagggg agtgtatgtc 1440 

ggagagaaga gagaacgttt aattagagag aattttttgt tttcggagtt ttagcgtttt 1500 

tagtttttta aatttttgtt taggaagtcg aaggatttag gtttaggtaa cggtttgggg 1560 

tggggcggta agagcgtcgt tttgtattcg gacgtcgttt gttaggcgtt atttggttat 1620 

gtgcgttcgt agcgtagggt ttttcggtta ttagtatagg tttttatcga ggtgatagtt 1680 

atcggattgg gaaattaata ttgcgaggat tcggttatgt gatgatatcg ttcgggggaa 1740 

ttcgagtgga agtcgatcgt tttgtggtgt tattagatag gtgagaagta gtttttttaa 1800 

atagggtttt ttgttggaag gaggaggtat ttttttttag ttagtgagtt tttttagttg 1860 

taatcggggt tttttttaat attagttatc gcggttttta gaggtgtagt tcggtatatt 1920 

ttttttttgt agatgtataa atcggggata tttttatttt tatttaagat gttttttttt 1980 

tatttagtag aggggtgcgg agtaaattcg ggataataat ttgcgtgttg tttggaagta 2040 

ggttttttag aaaggatgat aaaaatttgg tgatgcggaa gaagttttta ttgtgtttag 2100 

gaaagggtag cggtttttta gttgtatcgg ttttggtttc ggtttttatt tttagtacgt 2160 

tttgttattt taataaagag cggttttttt atttgatttt aattcgtatt agttagcgtc 2220 



gaggaaaga 



2229 



<210> 38 
<211> 2820 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 38 

tttaggggat aatatttagg tatgttaacg gagtttaaaa tgttaaggaa attatattat 60 

aattttgttt agtatattat aggttgttaa atcgaaatgt tatgttagtt aggagtgtag 120 

taatttttat tttttggttt tatttaatta ggaagtttta gtagagcgaa gtttgttaag 180 

cgttcgtcgt tagaatttga aggaattcga gcgagtaaga agagtgtttg atttatttta 240 

tagaagtttg tttagaaatg gaggagttag cgtttattga agtcggtttc gttttcggtt 300 

cgtttatatg gagtttgatt agttttagtt atgtttattt cggtttggga gattcgtaaa 3 60 

gtgttttttt ttttaatttt tttgtattat tttgaagttt agggaagtaa agagaggggt 420 

atatttggat tgtaaaatta atgttttttg tcgtttagga gagaagggaa tgagagagag 480 

agagagatag atagatagag agagagagag agagagagag agagagagag agagagagag 540 

agaaatttta ttgaaattta gtttttttag aatttgtgtg atttggtttt taacgggaga 600 

ttagtgcgat tttatggtat ttttgttagg aattagcgat ttttttgtag ttattatttg 660 

atttattgtt ttttcgttta ttttttttta taaagttatt ttttttttat tttagtaaga 720 

tttttttttt taatgatgat aaagtttttg ttttagtgtt ttttttagga ttggtgtttt 780 

tttaaaatag tgaatttaga aaattatttc gtttaatatt ttttaaaatt ttcgtagttt 840 

taatgtaagc gtaagtatgt aaaggttttt tgttatattt gtattttttg tttattttag 900 

aattattttt tattttcggg tctgtaatag ttttttttgt ttttttggat agaggtgggt 960 

ggtattaggg gtttagggta gtaggaggtg aggggttgag gaggtcgtta gggtaggttg 1020 

gtttgtgttg gatacgcgtg tttttttgcg gagttaaagg gtcggggacg ggggttttgg 1080 

atttattaga gtaattttag tcggtgggcg tttggtagtt atttaaggag gtagggaaag 1140 

tagcgagttt tatcgggcgg gttacgatga gtagtatgac gggtagtagt agtagttagt 1200 

aaaagttttc gtaaagtgtt tagttgttgt attgtcgcgg ggatttttat agtattatga 1260 

ttagttcgtg taattttgta gtagtaaacg gttttcgagg aatataggat cgcgggggtc 1320 

gggtagcggg ttattgagta tttcgcggac ggcggtagta gaggcggcgg cggtggtagt 1380 

ggtattcggc ggggaagtag tagttaaatt cgcgtatgat ttcgagagtt ttagtaatat 1440 

ttagggattg ggtttagttt cggagcgaga gggtcgttcg ttgagaagtt gcgtcggaga 1500 

cgcgggaagt tgttgttata aggagggagt tttgggaagt cggaggatag gaggagacgg 1560 

gagtttaggg gtagacgagt ggagttcgag gaggtagggt ggagggagag ttaaggcgtt 1620 

tcgtagttcg gtagtcgttt ttcgagtttt gtcgttcgta tttttttggc gtttgggaag 1680 

tagtaggttt ttagttcgtt cggggttacg tgggaagagg tagtcgggtt ttgattggtg 1740 

gagtaggatg taggtttcgg gagggagggg tcgacgagga ggtgtaagga tgtaaggagg 1800 

aggcggtcgc ggaagttata gatgggttcg ttcgttaggc gttggttcga gtggggttag 1860 

gcggggatgg tttaaatgag aagttcgggt tttagggtgg gttattcgta tatttatata 1920 

ttattcgttt tatttttcgt tttaggacgt tttttatcga aggcggggtt cggattagcg 1980 

tttttttttc gcgcgtgatt tcgggtcgcg agtgcgggtc gcggttgggt ggcgtttttt 2040 

cgagttggag atggtggggg cggaggtgtt agaggagtag tagtagtagg gtagagaggg 2100 

gcgagtcggc gcgggagagg gcgttttgtt ggcgatcggc gttttagcgt gcgggagcgc 2160 

gtcgtttagg ttgtaggggg atgtaggttg ggaatgtcgc ggcggagagg ttagggacgt 2220 

ttttttaggg atttatagga aagagggtga gaggcgatgg tgttagaatc gtttttgtcg 2280 

atttggaagt aatagtagta . ttttttataa gagcgtgtaa ttttaaggtt gttcgtcgag 2340 

gtagtttagt tatttcggta ggcgtttttt tttttttttt tttttttttt tttttttttt 2400 

aggtttttcg tagtttcgat ttagtttaag cgttcgtagg tttgaatttt tttttttatt 2460 

attcgttttt ttttagttcg tagtttatta gtgtgtttat ttgggaggtg cggttagatg 2520 

tgtttggaag gttagattgg tcgggataag tggtttgaga gaaagagaaa ggtttttttg 2580 

tatacgtcgc gggtgggttg tcgggagtat cggtcgggta gcggcgttcg ggaaggggag 2640 

agcgggtttt atttgttggt ttaggtagtg attttgcgtt ttttattcgg gtttttgtcg 2700 

gatggtcggt gatttggggc gacgagagaa ggtttaattc ggtaggagtt tttggttttg 2760 

cgcgtttttt ttattttttt tagcgggaag ggtaaacggt atagcgggat tcgtttttcg 2820 

<210> 39 
<211> 2820 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 39 

cggaaggcgg gtttcgttat gtcgtttgtt tttttcgttg gagagaatga aagaaacgcg 60 



tagagttaga gatttttgtc gagttagatt ttttttcgtc gttttaggtt atcggttatt 120 

cggtaaagat tcgagtaagg aacgtagggt tattgtttgg gttaataaat ggagttcgtt 180 

tttttttttt cggacgtcgt tgttcggtcg atgttttcgg taatttattc gcggcgtatg 240 

tagaggagtt tttttttttt ttttagatta tttgtttcga ttaatttgat tttttaaata 300 

tatttgatcg tattttttag gtggatatat taataggtta cgggttggag aggagcgggt 360 

gatgaggaga gggatttaaa tttgcgaacg tttgggttgg gtcggagttg cggggggttt 420 

gggaggagag aggggagaag agagaaggaa ggagagcgtt tgtcgggatg gttgagttgt 480 

ttcggcgagt agttttgggg ttgtacgttt ttgtgggaga tgttgttgtt gtttttaggt 540 

cggtaagagc ggttttaata ttatcgtttt ttattttttt ttttgtaaat ttttagagaa 600 

acgtttttgg ttttttcgtc gcgatatttt tagtttgtat ttttttatag tttaggcggc 660 

gcgttttcgt acgttggagc gtcggtcgtt agtaggacgt tttttttcgc gtcgattcgt 720 

ttttttttgt tttgttgttg ttgttttttt gatattttcg tttttattat ttttagttcg 7 80 

gagagacgtt atttagtcgc ggttcgtatt cgcggttcgg ggttacgcgc ggaagagggg 840 

cgttagttcg gatttcgttt tcggtagggg gcgttttgga gcggagagtg aggcgaatgg 900 

tatatgagtg tgcgggtagt ttattttgaa gttcgagttt tttatttgag ttattttcgt 960 

ttagttttat tcgggttagc gtttggcgag cgagtttatt tgtggttttc gcggtcgttt 1020 

tttttttgta tttttgtatt ttttcgtcga tttttttttt tcgggatttg tattttgttt 1080 

tattaattag agttcgattg ttttttttta cgtgatttcg ggcgggttga ggatttgttg 1140 

ttttttaaac gttagaggga tgcgggcggt agagttcgag aggcggttgt cgggttgcgg 1200 

ggcgttttga tttttttttt attttgtttt ttcgggtttt attcgtttgt ttttggattt 1260 

tcgttttttt ttgtttttcg gttttttaga gttttttttt tatggtagta gtttttcgcg 1320 

ttttcggcgt agttttttag cggacgattt tttcgtttcg gggttgagtt tagtttttgg 13 80 

atgttgttga aattttcgag attatgcgcg ggtttggttg ttgttttttc gtcgggtgtt 1440 

attgttatcg tcgtcgtttt tgttgtcgtc gttcgcggga tgtttagtag ttcgttgttc 1500 

ggttttcgcg attttgtgtt tttcggaagt cgtttgttgt tgtagagttg tacgaattag 1560 

ttatggtgtt gtgggagttt tcgcggtagt gtagtagttg gatattttgc gagggttttt 1620 

gttggttgtt gttgttgttc gttatgttat ttatcgtagt tcgttcggtg aagttcgttg 1680 

ttttttttat ttttttaagt gattgttaaa cgtttatcgg ttggaattgt tttggtaagt 1740 

ttagaatttt cgttttcgat tttttaattt cgtagaagaa tacgcgtatt tagtatagat 1800 

tagtttattt tagcggtttt tttagttttt tattttttat tgttttagat ttttaatatt 1860 

atttattttt atttagagaa ataaggggaa ttgttgtagg ttcgggggtg aggggtggtt 1920 

ttgggatggg tagaaagtgt aggtgtagta ggaaattttt gtatgtttgc gtttatattg 1980 

gagttgcgag gattttgaga aatattaaac gggatggttt tttgggttta ttgttttgaa 2040 

agagtattaa ttttagggga aatattgaaa tagaagtttt gttattatta aagaaaaaag 2100 

ttttattagg atgaggaaga aataatttta tgagaaagaa tgagcgagaa agtaataaat 2160 

taaatggtga ttgtagggga atcgttgatt tttggtaaag gtgttatgag gtcgtattgg 2220 

tttttcgttg aagattaggt tatatagatt ttagaggagt tgggttttaa tagaattttt 2280 

tttttttttt tttttttttt tttttttttt tttttttttt ttttatttat ttattttttt 2340 

ttttttttta tttttttttt ttttaggcgg taaaagatat tggttttgta gtttagatat 2400 

gttttttttt ttgttttttt aagttttaag gtagtatagg ggagttgaga aaaagaatat 2460 

tttgcgggtt ttttaggtcg gagtgggtat gattgaggtt ggttaggttt tatgtaggcg 2520 

agtcgagggc ggaatcgatt ttagtgggcg ttgatttttt tatttttgga taggtttttg 2580 

tggagtgggt taggtatttt ttttgttcgt tcgggttttt ttagattttg acggcgaacg 2640 

tttggtaggt ttcgttttgt tgaagttttt taattaaata gggttagagg atgggagttg 2700 

ttgtattttt agttggtata gtatttcggt ttgatagttt gtagtatatt gggtagaatt 27 60 

gtggtgtaat ttttttggta ttttaaattt cgttaatatg tttgggtatt gttttttagg 2820 

<210> 40 
<211> 2280 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 40 



attagtatag aattgttatt ataggatcgt agtttaagta ttagagtgat attaattggt 60 

tatgtttaat tgttttaaaa ttatttttta aataattata ttgatattat aatagaaatt 120 

atgggtattt attttatatt tagatggaag gtattattgg atatgtatta aaaaaaagat 180 

tttttgaaaa aaataaaata aaataaaata ttattattaa aataaaagaa tttaaaataa 240 

tttttaaaaa ttttttttaa taaaatatat tgttaattta taaaatggat tgatgattag 300 

ttatgtaaat gttttaaata aaatttttat atttttttta tagttaattt atgttttgaa 360 

ttgtttatcg attataaata atggcgaaat ggtatttttt gattatattg tattttgttt 420 

atagaaagtt tgatacgatg gaatttatta ggtaagaggg tgggtgttgt gaacgagatg 480 

tcgtttttag tcgcgggggc gggtagagtt tttggagcgc gtggatttta tgcgagttat 540 

gtagtatttt ttgttttttg ttagaagtta aagttattta tttataatat atttatgttt 600 

tcgtgtaatt gtttattttt gcgtagttaa tagggagtta ttacggggtt aatttatagg 660 

ggaaaaatag atatttattt ttttatatag atgtggatat atgtatatat gtatagaatc 720 

gcggtattta attttatagg tttcggggtc gagggcgagg gtattgttag tttttttagt 780 

agggtcgtgt tgggtttttt gttaaaaggg gtttttagta aagagcgagt tggttgcgtt 840 



tttttagttt 
tttttcggtt 
cggggatttg 
cgtttttttt 
tatagttata 
agttatagat 
tggatcgttt 
tgataaatat 
ggtaggagtt 
ttaacgtggg 
tgaaattttt 
cggaatttaa 
ggttttcgtg 
ggaggtatat 
cgtttatagg 
tggtattgaa 
ttaggttttg 
tgtatagtta 
tattgttgtt 
tgttgtggat 
aattagttaa 
tgtggtcgta 
agttattcgt 
ttgtattttc 



tttatagttt 
gtgcgagttt 
ggtatacgtt 
ttgttttttt 
tattttgtgg 
tgtttgtagg 
tcggtttttt 
tagtagtagt 
ggagtgcgtt 
cgagatttta 
tttaaagaag 
gcgttttgtt 
gaggtttatt 
gaagtaattg 
tagggatttt 
agtgttcggg 
tagtttcgcg 
tttgaggttt 
tagatttaat 
tttgttattt 
gttgtttacg 
gttgtagttt 
cgttatcgtt 
gtttggagta 



gttttttgtt 
atagttattt 
tttttagttt 
ttattggggt 
gttttatttg 
gaattaattt 
aggataaata 
ttcgggatta 
tcggttttcg 
gcgtatagtt 
tttatcggcg 
tattgtttga 
tggatgatgg 
ttgtgtaggt 
agtaggttgt 
aggatgagtt 
aatttttttc 
aaaaggtttt 
agaatgaggt 
cgtaggaaaa 
tttttgttgt 
ttagggtagg 
gttgtgggtt 
gattttggtg 



ttaaggaggg 
atttatttat 
tcggggtgtt 
tggggtttta 
cgttgtaagg 
cggacgcgga 
cgagtatgtt 
atatcgagat 
ttaatttagt 
gagggtagat 
ttttatattt 
aaggtataat 
aggttaattg 
tgagtttaga 
tgttgagaat 
ggatagcgtt 
gggatagcgt 
tgaaagtgtt 
ttttgtaatt 
gtttttgtac 
tgtagtttat 
gtaaattgtt 
tgattttgat 
tggtatgatt 



agttaagtaa 
gtttatttag 
tgcggtgggg 
gtttgagagc 
tttattgtgt 
ggagttggta 
tattacggtg 
ggatattttg 
gttgttttta 
ttcgttattg 
gaggtcgttt 
tgtgtaggag 
gtttagtatt 
gagcgagatt 
gaggattttt 
gtattttacg 
gtttaggtaa 
gttttttata 
tataaagtgc 
gttagagagt 
ttttaaattc 
agttaagggt 
ttgttagttg 
tttttgttta 



ggggttaggt 
gttgatggcg 
aaggatgtat 
gagtgagagt 
tagtaggaag 
tttcgttttt 
aaggcggagg 
ttggtgttta 
ttgtgcgaag 
gagaggagta 
attagtattt 
tatttttagg 
tttgttatcg 
ttagcgaata 
agtttgggta 
tttaggtatt 
ttgttattta 
gtagcgatgt 
gattttcgga 
ttgggtttta 
gattttggga 
ttgtttttgg 
tttgggattt 
tttgttttga 



<210> 41 
<211> 2280 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



<400> 41 



ttaagataaa tgggtaagag gattatgtta tattagggtt tgttttaaac ggaggtataa 60 

agattttagg taattggtag attaaaatta gatttatagt agcgatagcg acgggtagtt 120 

ttaggaataa atttttagtt aatagtttat tttgttttgg gggttgtagt tgcgattata 180 

ttttagggtc gggtttaaag atgaattgta ataataggaa cgtgagtagt ttggttgatt 240 

tgaagtttaa gttttttaac gtgtaggagt tttttttacg agataataag atttatagta 300 

ttcgaaaatc gtattttgtg gattataaga attttatttt gttggatttg ggtaataata 360 

atatcgttat tgtagagaat aatattttta agaatttttt ggattttagg tggttatata 42 0 

tggatagtaa ttatttggat acgttgtttc gggagaaatt cgcggggttg taaaatttag 480 

agtatttgaa cgtggagtat aacgttattt agtttatttt ttcgggtatt tttaatgtta 540 

tgtttaaatt gaggattttt atttttaata ataatttgtt gaggtttttg tttgtggacg 600 

tgttcgttgg ggtttcgttt tttaaattta gtttgtataa taattatttt atgtattttt 660 

cggtggtagg ggtgttggat tagttaattt ttattattta gatagatttt tacggaaatt 720 

tttgggagtg tttttgtata attgtgtttt ttaagtagtg ggtagaacgt ttgggtttcg 780 

aagtgttgat gagcgatttt aagtgtgaga cgtcggtgaa tttttttaga aaggatttta 840 

tgtttttttt taatgacgag atttgttttt agttgtacgt taggatttcg tttacgttaa 900 

tttcgtatag taaaaatagt attgggttgg cggagatcgg gacgtatttt aatttttatt 960 

tagatattag tagggtgttt atttcggtgt tggtttcggg attgttgttg gtgtttgtta 1020 

ttttcgtttt tatcgtggtg ggtatgttcg tgtttatttt gaggaatcga aagcggttta 1080 

agagacgaga tgttaatttt ttcgcgttcg agattaattt tttatagata gtttgtgatt 1140 

ttttttattg gtataatggg ttttataacg tagatggggt ttatagagtg tatgattgtg 1200 

gtttttattc gtttttagat taagatttta attttaatag gggagggtag agggaaggcg 1260 

atatattttt ttttatcgta ggtatttcgg gggttggagg ggcgtgtatt taaattttcg 1320 

cgttattagt ttggatgggt ataagtagat aaataattgt gagttcgtat aatcgaaagg 13 80 

gtttgatttt ttatttagtt ttttttttga aataaagagt agattgtgga gagttgggag 1440 

agcgtagtta gttcgttttt tgttgagagt tttttttgat agaaagttta gtacgatttt 1500 

gttggaagaa ttgatagtgt tttcgttttc ggtttcgggg tttgtggggt tggatgtcgc 1560 

ggttttatat atatatatat atatttatat ttatatagag agatagatat ttattttttt 1620 

tttgtggatt agtttcgtga tggttttttg ttggttacgt agggatgggt agttgtacga 1680 

aggtatgaat gtattgtaaa taagtaattt tgatttttga taaaaaataa aaagtgttgt 1740 

atggttcgta tggaatttac gcgttttagg gattttgttc gttttcgcga ttggagacgg 1800 

tatttcgttt atagtattta tttttttatt tgataagttt tatcgtatta aattttttat 1860 

aaataaaata tagtataatt agaaagtgtt atttcgttat tatttgtgat cggtaggtag 1920 

tttagagtat aagttaattg tgaaaaaaat gtaaaggttt tatttaggat atttgtatgg 1980 

ttagttatta gtttatttta tgagttaata atgtattttg ttgagggaag tttttagggg 2040 

ttgttttggg tttttttatt ttgatggtga tgttttattt tattttattt tttttagggg 2100 

gttttttttt taatatatat ttaataatgt tttttatttg aatgtaaaat aagtatttat 2160 



gatttttatt atagtattag tgtaattatt taaaaaatga ttttgaggta gttaagtatg 2220 

attaattaat gttattttag tgtttaggtt gcgattttat ggtagtaatt ttgtgttggt 22 80 

<210> 42 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 42 

ttttgttttt atgtttagtt tattgaagtt tgtttttagg ttattggagt tgagtttttt 60 

gaatattttt ggtttagagt gttgtagatt tatgaattgt tgtttgttga aataaggtgt 120 

aaagtttatt ttgtttaaag tttttaatag gagtgataat ttaggtaaaa tgtttatata 180 

atagtgggaa attaaaagtt tgcgatatag aaaagtgata tatatttgta aattttttta 240 

ataaaggtat cgtagtataa gtatgattgt agagttagat tgttaatgtt taaatttttt 3 00 

ttgggtttta atttttttat ttgaaaatcg ttaatagtgg aatttagttg atagggttgt 3 60 

tgtggagtaa gcgtttttaa tttggagttt atgaaatttt agtatttgtg aattttgtat 42 0 

tgaaaaaatc gtaattttat gtttattaat tttataatag tttcgttttt ttgaattata 480 

aaggtaggta ataaattata gtagtattat tagtatttat aattttgttt tttatagaaa 540 

ttatttttat atttagtaag ttgttgtaga tattttgaat aattatacgt atatattttt 600 

taagtattat atatataata tttttatatt ttgtatatat ttttggagat agggttttgt 6 60 

tttgtttttt aggttggagt gttgtagtat gatcggaatt tattgtaatt ttaaattttt 720 

gggttttagt gattttttta gttttagttt tggaatagtt aggattataa gtatttttta 7 80 

ttatatttag ttgttaattt tttttttttt cgtaaagata aggttttatt attttgttta 840 

ggttgttttt taatttttgg atataagttt tttgtatagg ttttttaaag tattgggttt 900 

ataggtgtga gttatcgtgt ttggttaata ttatgtattt aattttatgc gtttagaaat 960 

atgtgttatt ttaagaaggc gtttataggt tttattatat tattaaaggt gtttatggta 1020 

taagtgaaga aatttttaga tattagagta ataaaaatta tgatgtatta gaacggatta 1080 

ggtttatatt aaacgtttaa tatatagttt gttgttatgg tttttttttt tttttttttt 1140 

ttttttttga gatagagttt tattttgttt tttaggttgg agtgtagtgg agtgatttcg 12 00 

gtttattgta atttttgttt ttcgggttta tgttattttt ttgttttagt tttttaagta 1260 

gaggaattgt aaggcgtttt attatgttcg gttatttttt gtatttttag tagaaatggg 1320 

gtttttttat gttacggtag gttggttttg aatttttgat tttaggtgag ttatttgttt 1380 

tagtttttta aagtgttggg attataggta tgagttatta tgtttagttt gttatggttt 1440 

ttaaaatatt tatagaatag aaattgaagt aagtttattc gtttttatat taaaagtagt 1500 

tagtacggtg ttttgtattt atgtatttta tggatgttga atgagttatt gaatgaagtg 1560 

gagtttaggt tgtgttttag gttttttagt ggggttggaa ggaatagttg tgaagttgtg 1620 

gggcggtttt tgtgtatcgg gggattttgg agttagatgg ttttttaggg ttttgaagga 1680 

gaagagttgt gtttttaata atgattttaa tgggaataga gattgttggg gatcgtaggg 1740 

gttaggttgt tattttttaa agtttttttt ttattattta agaaggtatt taggtttttt 1800 

tgtgttaggt tttaggtaaa gtgttggatt atttagtaat tgggtttagt agtaggatgg 1860 

ttttagattg aggttttagg gttaaaggat tatttttttt tttagcgttg gttcgggaaa 1920 

ggtaagtttc gggcgggagc gtacgtcgcg ttttcgaagt ttggtttttt cgttacgttt 1980 

attttttgtt tttatttcgc gtttttttag gttttttttc ggtgaatcgg atgttttgtt 2040 

agtttttttt tttgcgtttt cggtcgcggt tcgggttttt cgtaaagtcg ttgttatttc 2100 

ggagggttta gttagcgggt tttcggaggt tggtcgggta ggcgtggtgc gcggtaggag 2160 

ttgggcgcgt acggttatcg cgcgtggagg agatattgtt ttgtcgcgat gggggttcgg 222 0 

ggcgtttttt tacgtcgtag gtaagcgggg cggcggttgc ggtatttgtt tatcgggagt 2280 

tttttttttt tttttttgtt gttgttgttt tgtatttagt tcgggggagg atagaagaaa 2340 

aaggaggtag aatggatttt tttggttttt ttttgtgggc gggggcgggt tagggtgggt 2400 

cgcgttgtta ggtagttttg tcgtgttgtt aggtagtttg gtcgtcggcg tgggcgatgt 2460 

cggcgttggg 247 0 

<210> 43 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 43 

tttagcgtcg gtatcgttta cgtcggcgat taggttgttt agtaatacgg tagggttgtt 60 

tggtaacgcg gtttattttg gttcgttttc gtttataggg gaaggttaag gggatttatt 12 0 

ttattttttt ttttttttgt tttttttcga gttggatgta gagtagtagt agtaggagaa 180 

ggaagggaaa gttttcggtg ggtaggtatc gtagtcgtcg tttcgtttgt ttacggcgtg 240 

aaggagcgtt tcgggttttt atcgcggtag ggtagtgttt tttttacgcg cggtagtcgt 300 



gcgcgtttag 

tgggtttttc 

ggaggagatt 

ggtaggaagt 

ggagtttgtt 

taatttgagg 

gtttagtata 

ataatttggt 

atagtttttt 

gaagtcgttt 

atttagattt 

tatcgtatta 

aatattttaa 

tgggaggttg 

atggggaaat 

gtagtttttt 

tgtagtgagt 

ttaaaaaaaa 

gatatgggtt 

tttttatttg 

aatagtatat 

ttatatttgt 

gagagtagtt 

ttgggtatgg 

attagagttt 

gggagataga 

ataatattta 

atgaaaataa 

tgtttgtttt 

gattttttta 

ttgttttata 

taaggtttag 

atgtttttgt 

tttttattat 

aataaatttt 

agaagtgttt 

aaaaataaag 



tttttatcgc 
gggatggtag 
gatagagtat 
gggcgtggcg 
tttttcggat 
ttattttgtt 
gaaagatttg 
ttttgcggtt 
ttttttagag 
tataatttta 
tattttattt 
gttattttta 
gaattatagt 
aggtaggtgg 
tttattttta 
tatttgggag 
cgagattatt 
aaaaaaaaaa 
tgattcgttt 
tgttatgaat 
atttttaaac 
aaatttaatg 
tgggtaagat 
tggggagtat 

aggagtttga 
gtaaaatttt 
aggaatatat 
tttttgtagg 
tataatttaa 
atataaagtt 
ataattttat 
aaaaaatttg 
taaaggagtt 
tatgtgaata 
atattttatt 
agaaagttta 



gtattacgtt 
cggttttgcg 
tcggtttatc 
agggagttag 
tagcgttgag 
attgaattta 
aatgtttttt 
tttagtaatt 
ttttgggaaa 
taattgtttt 
aatgatttat 
atataaaaac 
agattggata 
tttatttgag 
ttaaaaatat 
gttgaggtag 
ttattgtatt 
aaaaaaagaa 
taatgtatta 
atttttggta 
gtatgaaatt 
ttttgggaga 
agtgagattt 
ttgtagtttt 
ggttatagtg 
gtttttaaaa 
acgtataatt 
aaataaagtt 
agaaacgaaa 
tatagatgtt 
taattaggtt 
aatattgata 
tataaatata 
ttttgtttga 
ttagtaaata 
gttttagtgg 



tgttcggtta 
agggattcgg 
gggagaagat 
gtttcggggg 
gagaggagtg 
attattgggt 
tggatgatga 
tttattttta 
ttatttaatt 
ttttagtttt 
ttagtattta 
gaatagattt 
tggtggttta 
gttaggagtt 
aaaaagtagt 
gagaatggta 
ttagtttggg 
ttatagtagt 
tggtttttat 
atgtgataaa 
aaatatatag 
tttatgtagg 
tgtttttacg 
agttatttta 
agtttcgatt 
atatatatag 
gtttaagata 
gtaggtattg 
ttgttgtgag 
gaaattttat 
ttattgttaa 
gtttggtttt 
tattattttt 
attattattt 
ataatttata 
tttgagagta 



gttttcggga 
gtcgcggtcg 
ttggaaaggc 
cgcggcgtgc 
gttttttggt 
agtttagtat 
aggggaagtt 
ttgagattat 
ttagagtttt 
attggaaggt 
tggagtgtat 
attttaattt 
tgtttgtaat 
taagattagt 
cgggtatggt 
taaattcggg 
agatagaatg 
agattgtatg 
tattttagta 
atttgtagac 
tattagttag 
aggtttgtgt 
aaaaaaaaaa 
agattgaggt 
atgttatagt 
aatataaaaa 
tttgtaataa 
ataatattat 
gttaatagat 
ggattttaga 
cgatttttag 
atagttatgt 
ttgtatcgta 
ttgttaaaaa 
aatttgtagt 
agttttaata 



gttcgttggt 
aggacgtaga 
gcgggatggg 
gttttcgttc 
tttgggattt 
tttatttggg 
ttggggaatg 
tattaggaat 
tcgatgtata 
ttgaggtata 
aagtatagag 
ttattttatg 
tttagtattt 
ttgtcgtaat 
ggagcgtttt 
aggtagaggt 
agattttgtt 
ttgaacgttt 
tttagagatt 
gtttttttaa 
gtacggtggt 
ttaagagttg 
aaattaatag 
tggaaggatt 
attttagttt 
tattatatat 
tttattgagt 
tgtggtttgt 
ataaagttac 
ttaagaacgt 
ataaagaaat 
ttatgttacg 
agtttttgat 
ttttagataa 
attttgaatt 
gattaaatat 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



<210> 44 
<211> 5907 
<212> DNA 

<213> Axtificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 44 



gtcgtcggag 
cgagtttttt 
agattcgtta 
taaacgaaaa 
ggttggtggc 
gaattcgggt 
ttatttttaa 
gatttataga 
gtttagatta 
gtggttcggg 
cggcgaggtg 
gtttttttat 
tttaagggcg 
ttttttttag 
attttcgtat 
gtcgcgttag 
tttcgtagtt 
gtcgtcgttt 
cgttagttgc 
gatagcgatt 
gtttttttgt 
tgtttaacgt 
ttaggatggg 



tagggtttta 
tttttttcgc 
ggaggttgac 
gaaagagaaa 
gagcgattcg 
gattgagttt 
gggggattcg 
cgattcgttt 
gtttttagtt 
ggtgggagat 
ggaagggtta 
tttttttgtt 
tttttttttt 
tttttttttt 
cgtttgattt 
tttcgcgtag 
tagttcgcgt 
cgttagattt 
gagggttatg 
ttgcgttttt 
ttttttcgtt 
ggttacgttc 
tttttaggcg 



gcgtatcgcg 
gtttggtttc 
gagaggtgtt 
agttaatttt 
tataatcgtt 
gaggttgagt 
cggtcgggtt 
cgcgagagat 
agaggatttg 
tgcgagtatt 
ggatgcgagt 
ttcggcgttt 
gtttttagtt 
acgttttttg 
cgaggggcgg 
ttgttttgac 
ttcgcggtag 
gttgttagtt 
agttagagag 
tagttttcgt 
gtttttatcg 
gtgtttaata 
gttacggatg 



agtttttgtt 
gtttgcgttt 
tttatagagt 
tcgtttttat 
tttttcgcgg 
ttttttggga 
tacgcgcgaa 
tgataattta 
gggttgtata 
agtttacgtc 
ttagtatttt 
ttttttttac 
tacgtttgaa 
ttttcgggta 
gagcgtattg 
gtcgttgtcg 
tttcgtagtg 
tgttcggttt 
tttcggggcg 
ttttttgtat 
tcgttatggt 
agttttaggt 
aggaggcggt 



tttttcgttt 
ttttcggcgt 
tttgtttttt 
tttgtacgta 
cgtttttgga 
gcgatttatg 
ggatgggtgt 
gttttgcggg 
gttttcggat 
ggtttttttt 
ttttagaaat 
gcgttttttt 
tttttcggcg 
tttttttttt 
ggttgcgtac 
tcgtcgtcgt 
tattagttat 
agttttgaga 
tcgcgcggag 
cgcgtttttc 
tattttgttt 
taagatgagc 
gggtttcgcg 



tgtagcgtcg 
gggtattcgt 
tttagatttt 
tttggagagc 
cgggcggaga 
ttagttattt 
tagggaaatt 
ttttcggggc 
tttggagggg 
tattttgcgg 
gtagtatttc 
tcgttttttt 
tttttttttt 
ttaatttttt 
gggtgggggc 
cgtcgtcgtt 
tatcgtcgtc 
gagtttcgaa 
agtaagcgga 
gtattttcgg 
cgtagtaagt 
ggtatgttcg 
tattgcgacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



atttcgattt tgagtatcgt tagggtttgt agatggatat tttgaaagtc gagggagagt 1440 

tttgcgggga cgagggcgtt gaagcgttcg tcgagggaga tatttattat tagcgaggta 1500 

gcggagtttt tttgtcgttt ttcggtttta aggattaggt gggaggtggt ggcgaattcg 1560 

ggggttacga taagtttaaa attacggcgt gggaggtagg ttggaacgtg attaacgtta 1620 

tttaggtaag cgcgggattt ttagttttgt ttgttttttt tttttttagt ttagcgtgtc 1680 

gggttttgtt ttcgatagtc gttcggtgat ttcggtttgg agattttttt ttgtatttag 1740 

gaattttttt ttttttattt tttttagttt tgcgcgggga tttacgtttt taggcggtgt 1800 

ttttcgtttt aatttacgtt tttttttaac ggtattagtt gtaagatcgt taggttgaag 1860 

ttcggtttga gatatttgtt cggagatatt gtaaaagtga aggaaatggg gggagggagt 1920 

aggaagcgat gagaaagaaa gaaaattagg attggagggt acggtttggt tttggatttt 1980 

ggaacggatt tatagttgta tttttgggag gaaagaagaa ggggaaatcg ttgaggtcgg 2040 

agtttttttt tttcgtatat acgtatagat acgtatacgt atataggtag gtatattata 2100 

tgtttatagt tttttttttt acgaataaag gcgtaggtat aggcgtatac gatatgtatt 2160 

tatatataga ttatagtatt atttttaaat gtatttgcgt atatttttat agttttataa 2220 

ataaaagtat atatttgtat atattaaata tacgaatatt cgtttgaata tatgtatata 2280 

tatatgtata gatgtatgta gataggaatt agtttgtttt tttgggtgcg gttttgttgg 2340 

agatatatag gtatatatat acgaaagtat atatatgttt ttatataaat atatttagtt 2400 

cgtaataatt tagtattatt tgtgtataga ttcgtttttt tcgttcgaaa tttcggttgt 2460 

atcgtaaaga ggcggttagt tttttttcgt gtgattttgg cgaagttttt tttagaggtt 2 520 

aagaatggtt tttgtttttt aggttttttt tttttatttt taatttttta ttttttatta 2580 

ggcggcgtag gatttttata tagggagtag gtttggagtt ttggtaacga ttttcgagtt 2 640 

agtaggggtt ggtaggtcgt agttttttgg gtcgaaggag attttagttt taggtaattt 2700 

cggtttttta tagtttggtt tcgcggcgtt tttggagcgt attattattg gggttacgga 2760 

aggtaggttt tttgggagta gaggtttttt aggggttgtt ttatgtatcg gggtaagtag 2 820 

gattttatcg ggtttcgaat attagatggt tcgatttaag gcgtttttcg tttcggggtt 2 880 

cggcgagacg gttttgttgg attacggatt cggggttgtt ttcgttttcg ttgtgtttcg 2940 

cgatttttcg gggcgttttt taaggtttcg tttgataggt tttaagggag gttagggtag 3000 

gaggttggag gttcgtaggt tttcgggatt ttggatttcg ttagttttta gttttcgtgt 3060 

ggattttaaa ttttatagtt ttgttcgttt gattttgaag tttttggttt tagatttttg 3120 

ttcggttcgg ttttttggtt tttttggcgt tttttcgtta agtttttttt tttgtttgtt 3180 

tttttagttt ttatttagtt ttttttttta ggtttcgttt tcgttttatt ttaattttag 3240 

ttttgtttta agatttttaa tttttagttt tgtttttttt ttttcgcgcg ttgtttttta 3300 

agcgttttta tattgttttt gtttttgttt tttaggggtt ttgttttatt tttagttttt 3360 

tttttttttt attttatttt tttttcgttt tataattgta gtttttaacg tttttttttt 3420 

tggtttgagt ttgagacgtt cgagtttttt tttttagttt tcgtgtttta ttttagtttt 3480 

attataaatt tcgtgtttat tttttttatt ggtttaggtt tagtttatta ttttttttag 3540 

agtttttttt cgattacgtt tttttttgtt tttttttatt ttttttttta attttttatt 3 600 

ttgtttttgt ttttagaaat tatttcgaag tatttttttt tttttttgtg gttttttttt 3 660 

atttttagtt tttttttatt ttagtttttt cgttattatt agttattagc gtattttttt 3720 

tttttagatt cgtttatata tttttttttt tttatttttg gttcgatatt cgggtttatt 3780 

attagttttt ttatatttat ttcgtagttt gtttttaatt tttttttttc ggcggtttag 3840 

atttaatttt tagtgttttt agcgtttttt tcgggttata gcgttaagtt acgtttttcg 3900 

ggttttttat tcgttgttaa gttcgttgag cgttcgcgtt tggttgtttt ttcgttttat 3960 

agggtatgtt cgtgttgggt ttattttacg ttattttgta cggcggttat ttggggttgt 4020 

tttttattat tttcgtcgtc gttgtgtgtt gttatatcgg taagattttt atcgcgtgtt 4080 

tgtacgagga gaatgaagac ggcgaggtgg tgcgcgtgcg ggattcgtac gtggttatag 4140 

ttaacgtttg ttgcgtttcg cgttttttaa cgttgggcgg tcgagtggtg aacgtagcgt 4200 

agattatcga gttggtgatg acgtgtattt tgtacgtggt ggtgagtggt aattttatgt 42 60 

ataatagttt ttcggggttg ttcgtgtcgt agaagttttg gtttattatc gttacggtcg 432 0 

tgttgttgtt ttgcgttttt tttaagaatt ttaaggtcgt gtttaagttt agtttgttgt 4380 

gtattttggt ttatttcgtt attaatattt tggttatagt ttattgttta tcgcgggcgc 4440 

gcgattgggt ttgggagaag gttaagtttt atatcgacgt taagaagttt tttattttta 4500 

ttggtattat cgtgtttagt tatacgtttt agattttttt gttttcgttg gagggtaata 4560 

tgtagtagtt tagcgagttt tattgtatga tgaattggac gtatatcgta gtttgcgtgt 4620 

ttaagggttt tttcgcgttc gtcgtttatt ttatttgggt cgacgagatt aaggaggtta 4680 

ttacggataa tttgttcggt tttattcgcg tcgtggttaa tatttttttg gtggttaagg 4740 

cgttgttgtt ttattttttg ttattttttg tcgttgtcga ggtgttggag aagtcgtttt 4800 

tttaggaagg tagtcgcgtt tttttttcgg tttgttatag cggcgacggg cgtttgaagt 486 0 

tttgggggtt gacgttgcgt tgcgcgttcg tcgtttttac gttgtttatg gttatttatg 492 0 

tgtcgtattt cgcgttgttt atgggtttta tcggtagttt tacgggcgtc ggtttttgtt 4980 

ttttgttgtt tagttttttt tatttgcgtt tgttttggcg taagttgttg tggtattaag 5040 

tttttttcga cgtcgttatt ttcgttatcg gcggtatttg tagcgtgttc ggtttcgtgt 5100 

attttttcga gggttttatc gaagtttatc gaattaacgc ggaggattag ggcgtaaggg 5160 

cgagttttcg tcgcgttttt gcgttttttt tttttttttt ttatttcgtt tttattagtt 5220 

tagtgcgttt tgtcgtcgcg ttgggaggtt aagttttaaa tatttttggt ttttagtttt 5280 

tgattattcg gggatggggg ggatgggagg ggatagggat ttacgattta tcgcgtttgc 5340 

gtttttgttg tttttttttt tttataatat tttggttttg gggggaggcg gggtgtattt 5400 

gcgggtaggg ttttttgttt ttttaagtgg ggtttcgata ttttggtttt agttatcgag 5460 

ggggttggga agggagggag agggggcgta gttcgtaggc gtggtaattt gattttgggg 5520 

gaatatttta tatttattta gagttcggaa tttatagcgt ttagttattt ttagtaagag 5580 

cgttttttat ttcggagacg ttttaatttt gtagcgggaa aggttgattg ggaaatttat 5640 



tttgggtggg taattttttt taacgaagtc ggaaggcgag aagtcgcggc ggggttagtt 

tgtttgtcgg tttttaggaa tttaaatttt tattttgtgt aatttattag gtgtggaatt 

gttttattgt gcgtgtggtg tgttcgtggt gaataagatg aaatgtatat tagaaaaaaa 

tttattttta atttagagtg cggtatataa ttatattcgt aaataaagaa gagataaagg 
tttgcgcggt tcggtgtcgg gtttgtg 

<210> 45 
<211> 5907 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 45 



5700 
5760 
5820 
5880 
5907 



tataaattcg 
tgtatcgtat 
acgagtatat 
gtttagattt 
ttcgttgaag 
gttgaaacgt 
cgagttttgg 
gttttttttt 
tttggaagga 
ttgtggaaaa 
tttatttttt 
tttttaacgc 
agagcgtaga 
aggtttcgat 
tgacgaagat 
gtaggtgaaa 
ggtttatgag 
gcgcgtagcg 
ggaaaaaggc 
agaatggtag 
ggatggagtc 
aggcgacgag 
tgtagtggaa 
acgtgtagtt 
atttgatttt 
tattgatgac 
ttttaaggaa 
atacgggtag 
tgtacgttat 
ggaagcgcgg 
tttcgtcgtt 
atataacggc 
agggtaggtt 
agcgaatttg 
gggcgttaag 
ttgcggggtg 
ggagggatgt 
gggttagggt 
cgggatggtt 
agaagggagc 
ggaaagagtg 
aaattcgggc 
cgaaaaggaa 
taagaataga 
ttggaaattg 
ggaaggggat 
ttagggaggt 
cggatagggt 
ttcggaggtt 
agttttgaaa 
tcgtgattta 
atttggtgtt 
aggtttttgt 
gtcgcggggt 
gaaagttgcg 



atatcgggtc 

tttaaattag 

tatacgtata 

ttgaaaatcg 

gaaattgttt 

tttcggaatg 

atggatgtga 

tttttttttt 

tagagaattt 

gaaaggataa 

ttattttcga 

ggcggtaggg 

agcgcggcgg 

gaggttttcg 

ggcgacgtcg 

gaggttgggt 

tagcgcgaag 

tagcgttagt 

gcggttgttt 

aggataggat 

gggtaggtta 

cgcgaagagg 

ttcgttgggt 

gaatacgatg 

tttttaggtt 

gaagtgggtt 

ggcgtaaggt 

tttcgggaag 

tattagttcg 

ggcgtagtag 

tttatttttt 

ggcgaagatg 

tagtacgaat 

gtaacggatg 

gatattgaga 

gatgtagggg 

atggacggat 

gggagagggt 

tttggggata 

gtagtcggaa 

ggtacgggat 

gttttaggtt 

atagggtaga 

gataatgtga 

agagttttag 

taagtggagg 

tagaaagtcg 

tgtagggttt 

tacgagtttt 

gacgtttcgg 

gtaaggtcgt 

cggggttcgg 

ttttagaaaa 

taagttgtaa 

gtttgttagt 



gcgtaagttt 
agatagattt 
gtagaatagt 
gtaggtaagt 
atttaaaatg 
ggaagcgttt 
aatatttttt 
taattttttc 
tgttcgtaaa 
tagaaacgta 
ataattagaa 
cgtattgggt 
gggttcgttt 
agggagtgta 
aagaagattt 
agtaagaaat 
tgcggtatat 
ttttaggatt 
ttttggaaga 
aatagcgttt 
ttcgtgatga 
tttttgagta 
tgttgtatat 
atgttaatgg 
tagtcgcgcg 
agagtgtata 
agtagtacgg 
ttgttgtata 
atgatttgcg 
gcgttggtta 
tcgtataggt 
atgagaaata 
atgttttgtg 
aggggttcgg 
attgggtttg 
aattaatagt 
t tgagaagga 
tgagaatagg 
ggaatagaat 
aggggtttta 
ttgtggtggg 
tagattagag 
gagggagagg 
agacgtttgg 
gataaggttg 
ttggggaggt 
aatcgagtaa 
agaatttata 
tagttttttg 
gaagtcgcgg 
ttcgtcgggt 
tggagttttg 
tttgtttttc 
gggatcgggg 
ttttgttagt 



ttgttttttt 
ttttttgata 
tttatatttg 
tggtttcgtc 
gattttttag 
ttgttggaaa 
taaggttaag 
gatgattgga 
tgtatttcgt 
gacgcgatgg 
attaggaatt 
tggtgggggc 
ttgcgtttta 
cgaagtcgga 
ggtgttatag 
agaggtcggc 
aaatggttat 
ttaggcgttc 
gcgatttttt 
tggttattag 
tttttttggt 
cgtaggttgc 
tgttttttag 
agatggggaa 
ttcgcgatag 
gtagattgaa 
tcgtggcgat 
tgaggttgtt 
ttacgtttat 
tggttacgta 
acgcgatgag 
attttaggta 
gggcggagag 
ggggcgtggt 
agtcgtcggg 
gagttcgggt 
aaggtgcgtt 
gagaagttat 
gaggagttgg 
agaaaggtag 
gttggaatgg 
gaggaaacgt 
ggttgaaggt 
ggagtagcgc 
gagttagagt 
aagtaggagg 
gagtttagaa 
cggggattaa 
ttttggtttt 
ggtatagcga 
ttcggagcgg 
tttatttcga 
gtggttttag 
ttgtttagag 
tcggaggtcg 



tttatttgcg 
tatattttat 
ataaattgta 
gcggtttttc 
ttagtttttt 
tggttggacg 
ttgttacgtt 
attaaagtgt 
ttttttttaa 
atcgtgaatt 
agagatgttt 
ggggtgaggg 
gtttttcgcg 
tacgttgtag 
tagtttgcgt 
gttcgtgagg 
gagtagcgtg 
gtcgtcgttg 
tagtatttcg 
aaagatgttg 
ttcgtcggtt 
gatgtgcgtt 
cgaaggtagg 
ttttttgacg 
atagtaggtt 
tttggatacg 
aatggattag 
atttattatt 
tattcggtcg 
cgagtttcgt 
gattttgtcg 
gtcgtcgtgt 
gtaattagac 
ttaacgttgt 
gaggagaggt 
gtcgagttag 
aatggttggt 
aggagggaag 
gggaggagat 
tgagttgggt 
ggtacgggag 
taaaggttgt 
ggggtaaagt 
gcggggagga 
gaggcgaggg 
aggagtttgg 
ttagaagttt 
gagttaacga 
ttttggggtt 
aggcgggagt 
aaagcgtttt 
tatatggaat 
tgatagtgcg 
ttgaagtttt 
ttgttaagat 



gatataatta 

tttatttatt 

taagatgaga 

gtttttcggt 

tcgttgtagg 

ttgtagattt 

tgcgagttgc 

cggggtttta 

aattagggtg 

tttgtttttt 

aaagtttggt 

gagaagggag 

ttggttcggt 

atgtcgtcga 

tagagtaggc 

ttgtcggtga 

aagacgacga 

tagtaggtcg 

atagcggtaa 

attacggcgc 

taggtgaggt 

tagtttatta 

aagatttgag 

tcgatgtaga 

atgattagga 

gttttgaggt 

gatttttgcg 

acgtatagga 

tttagcgttg 

acgcgtatta 

gtgtagtagt 

aggatggcgt 

gcggacgttt 

ggttcgaagg 

taagagtagg 

agatggagag 

gatagcgaga 

ggagatattt 

gagaagggat 

ttgggttagt 

ttaaagagga 

agttgtgagg 

ttttgggagg 

ggaagtaggg 

cgagatttaa 

cgggaaggcg 

tagggttagg 

aatttagggt 

tattaaacgg 

agtttcgggt 

gggtcgggtt 

aatttttggg 

ttttagggac 

tttcggttta 

tttaaatttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



ttttttatgt 

agggtttaga 

gggaggagtt 

gtgtataggt 

ttttcgtgtg 

ttttgtttgt 

ttggtatgtg 

tggggatagt 

tgttcgtggg 

tatgcgtgtg 

ttaaaaatgt 

gatttttttt 

gttttcggat 

gggagggagc 

ttgggaggga 

atcgggcgat 

gaattgggaa 

tcgtgatttt 

agtcggaggg 

gcgttttagc 

ggttttggcg 

tcgtggtcgt 

tggatacgga 

tggggatagc 

agggttgggg 

tttggtttat 

tggtagtagg 

tcgcggggcg 

agagtagtta 

tttttcggaa 

gggacgtgag 

tgggggtaga 

cgtcgagggt 

cgtattttgg 

gttcgtagtt 

attttttggt 

tttcgcggag 

tcggtcgcga 

tagttttagg 

gattgtgcgg 

gttggttttt 

atttttcgtt 

gttaggcgcg 

ggtgcgttgg 



aggagttttg 

gagtaggggt 

ggtcgttttt 

aatgttggat 

tgtatatttg 

atgtatttgt 

taggtatatg 

attgtggttt 

aaggggggtt 

tgcgggggga 

agttgtgaat 

tttttttatc 

aggtgtttta 

gtgggttagg 

tggagaaagg 

tgtcgggggt 

tttcgcgttt 

gggtttgtcg 

cggtagagga 

gttttcgttt 

gtgtttaaag 

ttgaaaattt 

cgtggttacg 

ggaaaggata 

ggcgtaaagt 

gattttcgta 

tttggcggag 

cgagttgggt 

cgcgaagttg 

ttaggcggtg 

gaaggggtta 

agaagggcgt 

aggggaggtg 

ttttttttat 

ttttattttc 

taaggattgg 

cgggtcgttt 

gttttttttg 

tttagttatt 

atcgttcgtt 

tttttttttt 

agttttttag 

gaggaaagaa 

ggttttgttt 



cgtcgtttgg 
tatttttagt 
ttgcggtgta 
tattgcggat 
tatgttttta 
gtatgtgtgt 
tttttgtttg 
atatatagat 
gtgggtatgt 
ggagatttcg 
tcgttttaga 
gttttttgtt 
gatcgaattt 
gcggaaaata 
gagatt tfctg 
agagttcggt 
atttggatgg 
tggttttcga 
gtttcgttgt 
tcgtagggtt 
tcgaggtcgt 
attttggcga 
ttggatagtt 
gaaggattcg 
cgttattttc 
gttggcgttc 
cggcggcggc 
tgcggagggc 
gcgcggcgtt 
cgggggtggg 
ggagaagaga 
ttttgggaag 

ggggggcgaa 

ttcgtcgtcg 
ggattatttt 
tttgggcgtt 
ataagttagt 
aaggtagaag 
cgggtttttt 
attagtcgtt 
tcgtttgggg 
cgggtttgcg 
gggttcgcgg 
cgacggt 



tggggagtgg 
ttttggagag 
gtcgggattt 
taaatgtgtt 
atagggtcgt 
gtgtatatgt 
tggagttgta 
atatgtcgtg 
ggtgtgtttg 
attttagcga 
gtttaagatt 
tttttttttt 
tagtttagcg 
tcgtttgggg 
ggtataggag 
acgttgagtt 
cgttggttac 
attcgttatt 
ttcgttgata 
ttttttcggt 
cgtaatgcgc 
atatgtcgtt 
tgttgcggag 
aggatgcggg 
gtttgttttt 
gaggtttttt 
ggcggtggtg 
ggcggcggcg 
tttattcgtg 
ggattggaga 
aagggggcgt 
gaggcgggag 
gtgttgtatt 
tagggtggga 
ttttagggtt 
tcgaaagttc 
ttttttggta 
tggttggtat 
tcgttcgttt 
ttttaaatgc 
gtttggaaga 
aatgtttacg 
cgttgtaagg 



ggggttagga 
ggtttcgtta 
cgggcgagag 
tgtgtgggag 
atttaaggga 
ttaaacgaat 

ggggtgtacg 

tgcgtttgtg 
tttatatacg 
tttttttttt 
aaatcgtgtt 
atttttttta 
gttttgtagt 
gcgtaggttt 

ggggttttta 

gggagggggg 
gttttagttt 
attttttatt 
atggatgttt 
ttttaggatg 
gaagtttatc 
tattttggtt 
taaggtggtt 
gaacgcgatg 
cgcgcggcgt 
tagggttgga 
gttagtgtat 
gcggcggcgg 
cgtagtttaa 
gaggaaatat 
cgagggattt 
ggagcgcgtg 
tttaggggag 
gggggtcggc 
cgggagttgt 
gtaggattga 
tttatttttc 
ggatcgtttt 
agggacgtcg 
gtgtagaatg 
aggtaggatt 
tcgagagggg 
cggaggaagt 



ataaggggag 

gggttatacg 

gggcgagttt 
tatgtgtgtg 
gtaggttggt 
attcgtgtgt 
taggtgtatt 
tttgcgtttt 
tgtgcgtgtt 
tttttttttt 
ttttaatttt 
tttttgtagt 
tggtgtcgtt 
tcgcgtaggg 
ggtcgagatt 
aggataggta 
gttttttacg 
tggtttttgg 
ttttcgacgg 
tttatttgta 
gtttttttat 
tgggatttgt 
atggcggcgg 
taagaaggcg 
ttcggggttt 
tcgggtaagt 
tgcggagttg 
tagcggcgtt 
tgcgttttcg 
tcggaggtag 
agacgtgagt 
aaagagggag 
gtgttaagtt 
gtgggttgat 
gtagttttag 
gttgttagtt 
gcgcgtaaat 
taaggaagtt 
cgagggaagc 
gaggcgaaag 
ttatgaggat 
cgtagacggg 
agaggttcgc 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5907 



<210> 46 

<211> 3049 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 46 



tatttggtgt 
taggtgtttg 
gtttgtttta 
ttggggtttt 
ttgtaaatgg 
atggtgtagg 
gattgggatg 
tttagtgtgg 
ttaagtggta 
ggtggttttg 
aagtatttgg 
tgtaagtttt 
aagtggtttt 
agtagtgagt 
atgtttagtt 
ttgttgtttt 



agttggttta 
tggagttgta 
ggaagttgaa 
ggtttttgtt 
tgatgtgtgt 
agttgtggta 
gtgtgagtag 
gtgatggtgt 
gtagtttgtt 
ggttgggttg 
ttttgtgttg 
gtaggtaatt 
tgttgttttg 
tgggtattga 
ttatgatgtt 
tgttgttgtt 



gattttgttt 
tgttattatg 
tttgaggtaa 
tagggtgttt 
tatttgttgg 
gttgatggtg 
tgggtttgtt 
tttggtagag 
atttagtgag 
ttggggtgtt 
tagttggttg 
taggatgtag 
ggttagtttt 
tattattgtg 
ggggaagagt 
tttgttgggt 



ttgttttggt 
tggaagtttg 
tagtgtgagg 
ttaaatattt 
aatttggtgt 
atgtagtttg 
gtgtttttgg 
gtgttttttt 
ttttttttgt 
ttgaggtggt 
tgtttgggga 
tggaagagga 
tgtggttgtt 
tagtgttggg 
ggtggttttg 
aatttatgtg 



tggtgttgtt 
atttggatag 
tgtagtgttt 
ttttggttag 
ttgtttgtgt 
agtgttggtt 
atggattgtg 
gttgtttggg 
gtattttgtg 
gtatgagttt 
agtagttggg 
agttgttttg 
gttgtgtgaa 
ttatgtatat 
tggaggatga 
tgttgtttgt 



ggtaaaggtg 
tttgttgaag 
tgggggatgg 
tgatgttttt 
gttttgtttg 
gttgtttagt 
gttagttagt 
ttttgagtag 
ttgtgagggt 
tggtagtttg 
tagtatgagt 
gtttagaaag 
tatgtgttag 
ttggtttttt 
tgaggagtta 
tagagttatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



atagagaggt ggttgggttg ggatagtggt aggaagttgt gttgtattta ggagttgtaa 1020 

ttgttttttt tggagttttg gaatttggat gtttgtttag ttttgtgttt tgttgttgtt 1080 

gttgttgtta ttgttgttat tgttattgtt gttatttttt agagttgtgt ggagttgagt 1140 

ggtgtgagtg tgttgttgtg tgtttttttg agttttagtg tgttttgttt gttttgtttg 1200 

tgtgttttag gtgagtgttg ggttgtggtt tgtgtgttgt ttaagtagag ttgtttgttt 1260 

tggtgttgtg gtgtgtagga ggtgtgggtg gtggtggagt tttgtttatt gggtgagggg 1320 

aggtggggag tggggatttg atttgtattt taaatgtgga tgttatgtgg ggttgttggt 1380 

tttattttgg tttgtgagtg ttgggggtgt ttggattttt atttgttttt tatttattag 1440 

ttttgttttg tggttttgtt taattttttt ttttttttga agaggatttg tttgagattt 1500 

ggtggttttg gaagatagtt ttttggaggg atagttattg tgtttttttt ttttttgttt 1560 

tagtttgtat ttaaggattt tggaaggagg atgtgaattt ttggaagttt gttgttttgg 1620 

gagtggaaat gtgtggttgt ttttggtttt tttgtttgtg gaattgggtg tttttagtag 1680 

tgtagttgtg gtttgttatt gttggttgtt gtgttattga aagtagagag atgtgaaaaa 1740 

tttggagaga tgaaagagtt gttttagttt gagtgagtta gattttgttt aagttgtttt 1800 

gtagttagtt tggaaaataa ataaataaat ttagaaataa aaagtaaatt ttagtgtttt 1860 

gtagtaggat gttttatgat agtttatttt ttggttgtag agataaaatg agatgaagtt 1920 

ttaaatttgt ttttaattgt aatttttttt ttttgtaggg tttttgaggt attttatttt 1980 

gaagtgaagg ggtaagattt tggaatgttt ggtttttggg ttttatttag tttttgttga 2040 

attgtgtttt tagatttata atagttgtag atttaatgta atttttaaag ttatttttgg 2100 

tgtttgtatg tttatttttt taaaagttta ggtgtttttt tttttttgta tgttttggtt 2160 

attttttggt tttgtttaag tttttgttag ttttagaaag ggatatttat atatttgaag 2220 

gtttttgttt ggattttatt ttttaatttt tgttgtttat atgtttgttt tttttttttg 2280 

gattgtagtt tatagaattg tttttttttt ttggaagttt tttttggttt tttaggtggg 2340 

tggggttttt ggatggtttt ttatatattg tttgtgttgt ttttagtatg ttttgaaatg 2400 

ttatatttat gagtgattat ttgattattg atttgggtat gaaaataagt aagagttttt 2460 

gaatgttgta aaaagagaag agtgtagagg agggtgttaa gaaatgattg ggttattgaa 2520 

tgttttttgt gtatttgtta ttatttgtag ggtattaggg agtgaggaat gtaaaggtga 2580 

ataattttta aagtgttttt agtgtagttt aataagtaaa gggttagggg tttgtgtagt 2640 

tattatatta gtttgaagta tagattatat attaggataa attaggtaag atttttgttt 2700 

ttgataaaat ttattttttt gtgtaattat ttagtgagtt ttatagtttt tttatagata 2760 

aatgttattt ttattttgaa atgtttatat ttgtttaata attattgtgt ttatgttagt 2820 

gtgttgaaat gagattaatt tttaaattag ttattttaat tgtaagtata gattttgata 2880 

ttgaatgtat tttaatattt tttttttgtt gttgttttgt tttttgggat aaataataga 2940 

taatataaaa tttttaagtt ttagtgtagg attttgaggt ggttgtgtta ttaggtgtta 3000 

tattagtgtt agagatttga taaaattagt ttttttttta taaatttaa 3 049 

<210> 47 
<211> 3049 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 47 

ttggatttgt agaaaagagg ttggttttgt taggtttttg atattaatat aatgtttagt 60 

agtataatta ttttgaaatt ttgtattggg atttggaagt tttgtgttat ttattattta 120 

ttttaaagaa tagaataata ataaaaaaag aatgttgaaa tatatttagt attaaaattt 180 

gtatttgtaa ttaagataat tggtttgggg gttgatttta ttttagtata ttgatatgag 240 

tgtggtaatt gttgagtaga tataaatatt ttaaaataaa aatgatattt atttgtgagg 300 

agattgtgaa gtttattggg tagttatata gaaggataaa ttttattaag aataaagatt 360 

ttgtttgatt tattttgatg tgtggtttat attttaaatt gatgtgataa ttgtgtagat 420 

ttttgatttt ttgtttattg ggttatattg aaagtgtttt ggagattatt tgtttttgta 480 

ttttttattt tttggtgttt tgtaaataat agtaggtata taagaaatat ttagtgattt 540 

aattattttt tgatattttt ttttgtattt tttttttttt gtagtattta ggagttttta 600 

tttattttta tatttaaatt agtgattaga taattattta taaatatgat attttagaat 660 

gtgttgggag tagtgtagat agtgtgtagg gagttattta gaggttttat ttatttggga 720 

agttagggaa agtttttagg gggagggggt ggttttgtgg gttgtagttt gagggggaaa 7 80 

agtaaatgtg taggtagtag gagttgagag atgggattta gataagggtt tttagatata 840 

taaatatttt tttttgagat tgatgggaat ttggataggg ttgaagagtg attgaggtat 900 

gtgggagaga aggggtgttt ggatttttgg aaggataaat atgtaggtgt taaaggtggt 960 

tttaaaaatt atgttagatt tgtaattgtt gtgagtttaa gggtgtgatt taatgaaggt 1020 

tgaatgggat ttggagattg aatgttttgg agttttattt ttttattttg aagtgaagtg 1080 

ttttgaaggt tttgtgggaa gagggaattg taattaagag tagatttgaa gttttatttt 1140 

attttgtttt tgtagttaga aaataaatta ttgtaaagta ttttattgtg ggatattgag 1200 

gtttgttttt tgtttttagg tttatttatt tattttttag gttggttgta aggtaatttg 1260 

aataaaattt ggtttgtttg gattggggtg gtttttttgt ttttttaggt tttttgtgtt 1320 

tttttatttt tagtgatatg ataattaatg atggtaggtt gtagttgtat tgttggaagt 1380 

gtttagtttt gtaggtggga gggttggagg tggttatgta tttttatttt tggggtgata 1440 

ggtttttgaa agtttatgtt ttttttttag agtttttggg tgtgggttgg ggtgagaggg 1500 



gagggggtgt ggtgattgtt tttttaagga gttgtttttt gaagttgtta agttttgggt 1560 

gggttttttt tggaagaggg agaaagttgg gtgggattat ggggtggggt tggtgggcgg 1620 

ggagtgggtg ggagtttggg tgtttttagt gtttgtgagt tggggtggag ttagtagttt 1680 

tgtgtggtgt ttgtgtttaa gatgtaaatt gagtttttgt tttttatttt tttttgtttg 1740 

gtaagtgagg ttttgttgtt gtttatgttt tttgtgtgtt atggtattag agtaggtggt 1800 

tttatttaag tagtgtgtgg gttgtgattt ggtatttgtt tggagtgtgt gggtaaggtg 1860 

ggtggagtgt attggaattt aagggggtgt atagtggtgt gtttgtattg tttggttttg 1920 

tgtggtttta ggaggtggtg gtggtggtgg tggtggtggt ggtggtggtg gtggtggtgg 1980 

ggtgtagggt tgagtgagtg tttgggtttt ggggttttgg ggaaggtggt tgtagttttt 2040 

gagtgtagtg tggttttttg ttattgtttt ggtttggtta tttttttgtt atggttttgg 2100 

tggatagtat atgtggatta tttaatgggg gtggtggtgg gggtggtagt ggttttttgt 2160 

tgttttttgt ggagttattg tttttttttg atattgtgga gttgaatgtg gggggttagg 2220 

tgtatgtgat ttggtgttgt atggtggtgt tggtgtttga tttgttgttt tggtgtatgt 2280 

ttatgtagta gtagttgtag gagttggttt gggatagtaa aggttgtttt tttttggatt 2340 

gggatggttt ttttttttgt tatattttgg attatttgtg ggatttgtag tttgtgttgt 2400 

ttgattattt ttttgagtgt agttggttgt agtgtgaggt tgagtatttt gagttgttag 2460 

agtttgtgtg ttgttttggg gtgttttagt agtttggttt ggggttgttg tttttgtggt 2520 

gtggggtgta taaggagggt ttgttgggtg atgagttgtt gttgtttggt tatttggagt 2580 

ttgaatagta ggagggtgtt tttgttgggg tgttgttgtt tatgttggag ttggttagtt 2640 

gtagtttgtt tgggggtgtg gtgggtttgt tgtttatgtt gttttagttg ttggatggta 2700 

gttggtgttt gggttatatt attattggtt attgtggttt ttatattatt gggtgggatg 2760 

tgtaggtgga tgttaagttt tggtgagtgg tgtgtattat tgtttgtgga aagatgttgt 2820 

tggttaagga ggtgtttggg gatattttga atgaaagttg ggattttgat tgttttttgg 2880 

agtgttatat tttgtgttat tattttaagt ttaatttttt ggagtaggtt tttgataagt 2940 

tgtttgagtt gggtttttat atggtggtgt gtagttttat gggtatttgt gtttttgtta 3000 

gtagtattga ttagagtgag gataagattt ggattagtta tattgagta 3049 

<210> 48 
<211> 2407 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 48 

taaggtttgg gtatttttag gtagtaggga taaggtgggt tttttttttg gttgttaaat 60 

ttatgttaaa gttgagttta gggattggag tttaagaaat ttattgttta ttttttagtt 120 

tgattgggga tttgtatgta tttttgttgt gttgttttga gttttttaat tttttatatt 180 

ttttttttgt tatgtatatg tttttattta ggtttgtaaa agttttagtt ttttttaggg 240 

gtagggattt gtatgttggt ttagggtttg gtatgtgggg atgttgaaat agggttaggt 300 

ttggttttta gttgattgtt agagttttaa ggtttagtaa ttttttttat aaaggttttg 360 

ttaagaggtg aggaaataag agttgggaga ggggtgtgga atggtgggtg ggatgaatga 420 

ttagttttgt gtttttggtt agttgtgttg agttaggggt attgtggttg ttgtgtggtt 480 

ggaaatttag gaatgggaag gtttggggtt tgtttggttt tggaggtagt tggtgggttg 540 

tttttggtgg tgttggagtg gttagtggta gttgggtatg ggtgattggg ttgtttgggt 600 

tgtttttaga ttgtgatttt tgaaaaattt tgtgggtggg gtgtgtttgt gttgtttttt 660 

gttggaaggt gtgagttagt gtgtttgatt gtgggtgggg gtggaaagag gtgtgtttta 720 

aagtggtaat agatggtttt tttatttaat aggattaaaa aatttgtttt tatttggttg 7 80 

attgtggaag tagagtaggt gggtggttaa tggggatatg atggggggtg gagttgaggt 840 

ttttgaagtg gaagtgggtt gttgttgagg tggtggtatt ttttttgagg agtttttttg 900 

ggtggttgaa gaaggagttt tttttttgga gtgtgttggt ggtggtgttt gtggatttaa 960 

ttagttttag gttaggttga gttttgtggg aaagtagtgg taagttaggg ttttgtagat 1020 

gtgaggttta ggtagttttg tggtttatag aggttttggt ttgtgttttt tgggggagtt 1080 

gtgttgtgtg gtttgattta gttgaggttt tgtagtttgg gattttgagt tagttttggt 1140 

tgtttgtatt gttgtttgtg tgggtgtatt gagtttggtt tggtgtgggt aatagaagtt 1200 

aggaggtttg tgtttgggtt ttggtttatt ttggggggtt gtggttatgg ggtttagttt 1260 

ttagtttagg aagggaaatt gagattttgg gaggggtagg aatgttttta aggttatttg 1320 

gaaagttggg taggatgtgt tgttaggggg aagatttggg tagggttttt gttttttgtt 1380 

gatgatgttt tttttgtgtg tttgtgttgt tgttttgtta ttgtggggtt tgtgagtttg 1440 

ttggggtgtg tgggttgtgt ggtggggttt tttgtaggtt gggaggattt gagtatgggt 1500 

gtgggtttga ttgtgggggt gttgaggttg tagtttaaaa tttagtaggg ttttgatttt 1560 

tgggtgtgtt tttgggtttt ggttggtggt tggtggaatg tgtgattgtg aggtttgtgg 1620 

tttagttttg tattgtttgg gttttttatt gttttggtgt gtttatagat aggtgtatga 1680 

agatttttat gatttgaaat agagttgtta gtaaatattg tttttttgtt tttgatttat 1740 

tggggaagag ggaaaaggat agagtttggg taagttgttt tggtagggat tttagttttt 1800 

gttttttatt tgttagtttt tatagatgtt tataaattta ataatttttg ttttgttttt 1860 

gtattaagtt tttgagttag atgtagggtt ttagttttgg agtttggttt agattgttta 1920 

attgattggg gagattgagg tttagaaaag tgaagtggtt tgtttaaggt tatatagtta 1980 

gttatttggt agtagatgag gttaagtttt atttgtaaga tttgggtttt gaatttattg 2040 



attaggagtt ttgggattat tgttaataaa agagatattg aagggaattt tttgttattt 2100 

ttttggtgat ttgtttttta atggataagg atatattggg tttagtttta tttgtgagtt 2160 

tgaggtgaaa tagaggtatt tgagtagtaa gatatattgt tggtttttgt attgtttgaa 2220 

tttgagtttt taaaaatttt attttaatat attgtttatt gatttttttt tgaattatta 2280 

tttttgtaag gattttttgt aaatattgtt tttttaattt ttatgaaatt ttaatgttat 2340 

atgtaaatta ttttttttta tataatgtat gtatatttgt gttttgtata taaaatgagt 2400 

aagattt 2407 

<210> 49 
<211> 2407 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 49 



aaattttatt tattttatgt ataaagtata gatgtgtata tattatataa aaagaaatag 60 

tttatgtatg gtattaagat tttatgaaga ttagaaaaat aatgtttata aaaggttttt 120 

ataaaggtag taatttaaga aaagattaat aaatgatgtg ttaaagtaag atttttggaa 180 

gtttaaattt aggtaatata aaagttagta atatattttg ttatttgaat gtttttattt 240 

tattttaaat ttatagataa aattgaattt aatatgtttt tgtttattaa aaagtaaatt 300 

attaagaaag taataaaaga ttttttttaa tgtttttttt attgatagtg gttttaaaat 360 

ttttggttaa tgaatttaaa atttaaattt tgtaggtagg atttaatttt atttgttgtt 420 

aaatagttgg ttatgtgatt ttgggtagat tattttattt ttttggattt tagttttttt 480 

agttagttgg atagtttaag ttaggtttta gagttgagat tttatgtttg gtttaagaat 540 

ttggtgtaga aatagaatga agattattaa gtttgtgaat gtttatggga attgataagt 600 

gaaagataaa agttgaaatt tttattaaaa tggtttgttt aagttttatt tttttttttt 660 

ttttttgatg gattaaaggt ggaaaagtgg tgtttattag taattttgtt ttgggttgtg 720 

agaattttta tatatttgtt tataggtgtg ttagagtggt gaagggtttg agtggtgtag 780 

ggttgggttg taagttttat agttgtatgt tttattaatt attagttggg gtttggaagt 840 

gtgtttggaa attgaggttt tattaagttt tagattgtga ttttggtgtt tttatggtta 900 

ggtttgtatt tgtatttaga tttttttgat ttataaaagg ttttgttatg tggtttatgt 960 

attttggtaa atttgtaggt tttatgatgg tggggtggtg gtgtgaatat ataaaggagg 1020 

tgttgttagt ggggaataaa aattttgttt gggttttttt tttaatagta tattttgttt 1080 

gattttttaa gtgattttgg gggtgttttt gtttttttta gagttttagt tttttttttt 1140 

aggttaggaa ttaagtttta tggttgtggt tttttagggt gagttgagat ttagatgtag 1200 

attttttaat ttttgttgtt tgtgttaagt tgggtttggt gtgtttgtgt ggatggtagt 1260 

gtgagtgatt agagttggtt tgaggttttg ggttgtaaag ttttggttgg gttaagttgt 1320 

gtagtgtggt ttttttaaga ggtgtgggtt gaggtttttg taggttgtga agttgtttaa 1380 

attttgtatt tgtaaggttt tgatttattg ttgttttttt gtaaagtttg gtttaatttg 1440 

gagttagtta ggtttgtagg tgttattgtt ggtgtatttt ggagaagaag tttttttttt 1500 

agttgtttag gagagttttt tgagaaagat gttgttgttt taatagtaat ttatttttgt 1560 

tttggaggtt ttggttttgt tttttattat gtttttattg gttgtttgtt tattttattt 1620 

ttgtggttgg ttgggtaagg ataaattttt taattttatt ggataagaaa attatttgtt 1680 

attattttaa aatgtgtttt tttttgtttt tgtttataat tgagtgtatt aatttgtatt 1740 

ttttggtaag agatggtgtg ggtgtgtttt gtttgtaagg ttttttggga gttatggttt 1800 

gagggtgatt tgggtgattt ggttgtttgt gtttggttgt tattgattgt tttaatgttg 1860 

ttagggatga tttgttagtt gtttttggag ttgagtaggt tttgaatttt tttattttta 1920 

gatttttagt tgtataatag ttgtggtgtt tttggtttga tgtagttggt tggaggtgtg 1980 

gagttggttg tttgttttgt ttgttgtttt gtgttttttt tttggttttt gtttttttgt 2040 

tttttaatga ggtttttgtg aggaaggttg ttgggttttg ggattttgat gattggttgg 2100 

aaattaggtt tggttttgtt ttagtatttt tgtgtgttaa gttttgggtt ggtgtgtggg 2160 

tttttgtttt tggaggaagt tgggattttt gtaggtttgg gtggagatgt gtatataata 2220 

gaggaagagt gtggaggatt ggaggattta gggtagtatg gtagaggtgt atgtaggttt 2280 

ttggttggat tggagaatgg gtggtgggtt ttttgagttt tagtttttga gtttgatttt 2340 

gatgtgggtt tagtaattag gaaaaaagtt tattttgttt ttgttgtttg agaatattta 2400 

gatttta 2407 

<210> 50 
<211> 3952 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 50 



ggttgtagtt ggagaggaat aggaattgta gtggggatgt tttggttttt gggtttatag 



60 



tatttgggat ggatgtgtag tagtgtgggt tgggaattgg gtattagggt gggatgggtg 120 

agaggtttta agggataagg tagggagaag tgtagtgggg tgtggggaat tgtatgtttt 180 

ttttttgttt ttgtttttta ttttgaggtg gtagggtggg tgggtgtggt tttgggggtg 240 

ggtgggttgg gtggggtgga ggtggggttg tagtattggt tttatttagt tttttttgtt 3 00 

tgtagttaga gtgagttgag ttgtggttag ttttggtggg tagggggggt gttggagtgt 3 60 

agtgtagtgt agttttatta gtttgtaaag tggattgagt tggaagttga gtgttgttgt 420 

gggaggtggg tgatgggggt aggtgttatt ggttgtgtta tggatgggtt gtgtttgttg 480 

ttgttgttgt ttttgggggt gagtgttagt tggagggggt ttgttttttt ttttgggatt 540 

agaattttga gaagagttgg gtgttgttat taaggaaata gaatagagta ttggggtttt 600 

agatattgag ggtgggtggt gggaaaggat ttttgatgtt gggattatga aggagggttt 660 

agggtttttg gagtgtagag gtgatttttt agggtggaga tgagggtaag attggagtat 720 

ggatgttggt ttttaggtat tgtagggggt ggtgggggag ttgggagggg ttttttaaga 7 80 

gggggtatgg ggttttttga tgtttaggtt ttttggaaga ggatatttga atgttgggat 840 

tttgaaggga tttttttttt agtattttgg tttttggaga gttgtgggat agaattaagg 900 

tgagagaagg agggggaatt ggatggggat tagagatgag aggggtgagt tgggttgggg 960 

tggggagttt gggatgatgg gatggaataa tggaggaggt tggagggatt ggaggtagag 1020 

gggatttggt aaggataatg gggattggtt agataatggg ggggaggggt gttggaggga 1080 

gaggttgggg agtgggagag aggttgagaa gtgggttagg gagggtagaa gaggggtgat 1140 

ggtgagggtg gtagagagat ggaagtagaa ggaaaggaga tagagtgggg tgtggatttt 12 00 

gaagtggttg gggttttgtt tgtttgttga gggatagtga gatgagaaga gagagtagag 1260 

gtttggatat ttttagtgtt tgtttttgtg tttttgtttg tggttttttt tagttttgga 1320 

attggggttg tggatttaag gttgggaggg gagtgtttat tttgttgggg tgaatttgtt 1380 

tttggggttt ttttatggtt ggtgggtttt tttgatgtgt gtgtttttat ggtttgaagt 1440 

agtgtagtgg attgtgaggg atggagtggg tgtgttttta tttgggttgg tggtggttat 1500 

gttttggggt gtagagtgtt ggggtgtatt aggtggggat tttttttggt tgtgtgtagt 1560 

tttggttatg taggtgggag gtggtaggta gggggtgttg tggggtgagg taagtgtagt 1620 

ttgttgttgt ttgtttgtga gtttggggtt gtgtgttgtt ttttatttag ttgtgtgatt 1680 

tgagtgttgt gtttattgtg ttttgtgggt ttttgtgttt tgaagggttg ggatttggtt 1740 

tgagtttttt ttggtttttg gtgtaagttg ttgggtattt aatgagatgg gggtgtttgg 1800 

atgttttgga ttttgtagtg tggtattttg ggttattgat ttggtaaagg ttggtgggtt 1860 

ttttgggtgg tttgatgaga ttgttagatg tggtttgttt tatggttatg aattgaatgt 1920 

tttggaaggt ttggtaatgg aggaggtgtt tttatttttt ttgttagtat tttttattag 1980 

agtattgtgt tttaaatttt ggttttatag atatggagat tgaggttttg agagtttaaa 2040 

tgatttggtt aagattttat agtaagttgg gttagagttg tggttagatt ttttgatttg 2100 

atttttagtt tggtattttt ttattttttt agggggtagt tgttgtttta gtttggggtt 2160 

atggttgtgt tttttagtag ggtggggggg gttgatattg tttttagagt ttgttggtgg 2220 

tagagtttgt gggtgaggtg ggtagagatt ttgtttgggt gtgggtttta gtttgaaagt 2280 

ttagttttag ttttttggag ttgttaggat ttgtggttgt gtttggggat ttgggagtga 2340 

ttgtgggtat gtttgttgat taatgttgtt ggttagagat gttgtttaga tgtgggtggg 2400 

tgagtgtaga ttaggagagt ggggaggagg gattggattt tagaggggta atgatttggg 2460 

ttttgggttt gggtaagagt gagggttttt gtgtgaagat ttaaggaagg aggggagttg 2 520 

ggtgtggagg atttgggtgt ttgggaaggt gtattttgat aggaatttgt tgtttgtagt 2580 

ttttgtggtt tttgtttttg ttttggtgtg tggttgtttg gtttgggatg gttgtgtgtg 2 640 

gtttgagttt ttttttgggt tgtggtgaga gagagagggt tgaaattaga gtgtgttttg 2700 

gtgggttggt tggtgggttg tgttggtaat ggaggtattt tgttatttag atgtttgtaa 2760 

ttagagttgt tgggttggtt aatgtgttta atagggatgg aatgagggta gtaaatgggt 2 82 0 

gtgtgtgagt ggttgggttg agttgggtgg atggtggatg gggaggtagt tttgtggggg 2 880 

atgggttttt attgtattgt tttagtttag tttggtttaa gtgttttgtt ttttttttgg 2940 

gggattgtgt ggggaggtgg gaataattga tgtgttgagt ttttattaag tgttttgtat 3 000 

ttttttggta gttttttttg aattatttaa tttttataag gttttagtaa ggtgaattta 3 060 

gttattttta ttttgtagtt gaggaaatag aattttttag atttttaggt ttgagattag 3120 

gtttttttag tgagttagtg gtggtgtttt tgttggaatt taggatgttt ggtttgggat 3180 

gttttttttt ttatatatgt taggtttttt tttgtttttt attttttaga ttatggaagg 3240 

agttgtggtt ttgagagatt aaattgtgtt ttatgttttt ggttttatat atttagattt 3300 

gagttttttt tttatatgta gaggttttgt ttttttttaa gtttttatgt tatattattt 3360 

tgggtaatta gttttttttt ttttaatttt ttttttattt aaagggagtg aggggtatgt 3420 

gttagtgttt tttttgggtt tgtattgatt ttaaaaggtt ttttaggtgt ttgaaaattt 3480 

tttttatttt atggtggtga ttatggtttt atttatttaa gagttttttg ttgaagttgt 3540 

tggaatggtt agggttgatt atattgaaat atttgtatgg ttttaaatat ttttaattgt 3 600 

tttgttgttt tttttttttt tttaattaaa aaaaatgaaa atttttaaaa gttaatttat 3660 

ttaaggattt ttttatttta gttttttgta agttttgaaa tggttaaaat aaaaagatta 3720 

taagtgtttt taaaaatgag atgaaattat tttgttaaat gtgaaatgat taagtgtaat 37 80 

atatataaat tttagggaaa tttatttgtt atttttataa attaattttt ttggatttga 3 840 

ttttggagta attagtttga atgttagtta ttttttttta tttggttaaa tttttttttt 3900 

ttgtttgggg tttttttaag atgagatgtt tttttgataa ttaatattta tt 3952 

<210> 51 
<211> 3952 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 51 



aataaatgtt agttattaaa gaagtatttt attttaagga gattttaggt aggaggggag 60 

agtttagtta agtagggagg ggtgattaat atttaggtta attgttttaa aattagattt 120 

aaaaggatta atttataggg atgatagata aatttttttg aaatttgtat gtgttatatt 180 

tagttatttt atatttgata gaatgatttt attttatttt taagaatgtt tgtaattttt 240 

ttgttttggt tattttaagg tttataagga attaaggtgg aaggattttt agatgaattg 300 

atttttaaga atttttattt tttttaatta gagaaaagag agaaatagtg aaatagttga 360 

aagtatttga aattatatga atgttttagt ataattagtt ttggttattt taatagtttt 420 

agtaaagagt ttttaagtga atggggttgt ggttattgtt atgggatggg agagattttt 480 

aaatatttgg agggtttttt gaggttgatg taaatttagg ggaggtattg gtatatattt 540 

tttatttttt ttggataggg gaggagttgg gaggaaagga gttaattatt taagataatg 600 

tagtatagag atttgagagg gggtagggtt tttgtgtgtg gagaaggggt ttagatttaa 660 

atgtgtgggg ttaaaggtat agggtatgat ttaatttttt agagttatga ttttttttat 720 

gatttagaag gtagggggtg ggagggagtt tggtatatgt aggaaagaga atattttagg 780 

ttaagtattt tgggttttaa taaagatgtt attattgatt tattggggga gtttggtttt 840 

gggtttgaaa gtttagaagg ttttgttttt ttaattgtga agtgggaata attaaattta 900 

ttttgttagg attttataag gattaaataa tttaagagaa attattaggg aagtgtaagg 960 

tatttagtag gaatttagta tattgattat ttttgttttt ttatgtggtt ttttagggaa 1020 

gaagtgaggt gtttgagtta gattgggttg gaatagtgtg gtaggggttt gttttttgtg 1080 

gagttgtttt tttatttatt atttatttag tttagtttgg ttgtttatgt atgtttattt 1140 

gttgtttttg ttttattttt attaggtgta ttagttagtt tggtggtttt ggttatagat 1200 

gtttgaatga taaagtgttt ttattattgg tgtggtttgt tagttgattt gttgggatgt 1260 

gttttggttt tagttttttt ttttttattg tggtttagga agaaatttgg attgtgtata 1320 

gttattttag attgagtagt tgtgtgttga ggtggaggtg ggagttgtag gggttgtaga 1380 

tggtaggttt ttgttggggt atattttttt aagtgtttag gttttttatg tttagttttt 1440 

tttttttttt gggtttttgt gtggggattt ttgtttttgt ttagatttgg agtttaagtt 1500 

gttgtttttt tgggatttgg tttttttttt ttgttttttt ggtttatgtt tgtttatttg 1560 

tgtttgagta gtgtttttaa ttagtggtgt tagttagtag atgtgtttgt ggttgttttt 1620 

aaatttttgg atgtagttat aggttttgat agttttaggg aattggggtt gagtttttgg 1680 

gttggggttt gtatttggat agaatttttg tttattttat ttgtaggttt tattgttgat 1740 

ggattttggg gatagtgtta attttttttg ttttgttggg aaatgtagtt gtgattttga 1800 

gttgggatag tggttgtttt ttgaaaaggt gggggagtat tgagttggga attaggttgg 1860 

ggagtttagt tatgattttg gtttaatttg ttgtgagatt ttggttaagt tgtttaaatt 1920 

tttagagttt tagtttttgt atttgtaaag ttggaatttg gggtgtagtg ttttgatgaa 1980 

agatgttggt ggggagagtg aagatgtttt ttttgttgtt agatttttta gggtatttgg 2040 

tttatggtta taaagtaggt tatatttgat aattttgttg gattatttgg aggatttgtt 2100 

gatttttgtt gagttggtgg tttgggatat tgtgttatag aatttgaggt gtttgggtgt 2160 

ttttgttttg ttaggtgttt agtggtttgt attgagagtt aggagaggtt tagattggat 2220 

tttgattttt tgaggtgtgg gagtttatgg agtgtggtgg gtgtggtgtt tgggttgtgt 2280 

agttaggtgg ggagtggtgt gtagttttag atttgtaggt aggtagtggt ggattgtatt 2340 

tgttttgttt tgtagtgttt tttgtttgtt gttttttgtt tgtgtagtta gagttgtgtg 2400 

tggttaggaa gggtttttgt ttagtgtgtt ttggtgtttt gtattttgag atgtagttat 2460 

tgttagtttg ggtaggggta tatttgtttt gttttttgtg atttgttgtg ttgttttaag 2520 

ttgtgagaat atgtgtgttg gaggagtttg ttggttgtgg gggaattttg ggagtgggtt 2580 

tgttttggtg aagtgggtat ttttttttta gttttagatt tgtagtttta attttgggat 2640 

tgggagaggt tgtgagtagg agtgtgggga taggtgttgg aaatgtttaa gtttttgttt 2700 

tttttttttg ttttattgtt ttttagtggg taggtgggat tttgattatt ttagggtttg 2760 

tgttttgttt tgtttttttt ttttttgttt ttattttttt gttgtttttg ttgttgtttt 2820 

ttttttgttt tttttaattt attttttaat tttttttttg ttttttaatt tttttttttg 2880 

atgttttttt tttttattgt ttggttggtt tttattgttt ttgttgggtt ttttttgttt 2940 

ttagtttttt tgattttttt tattgtttta ttttgttgtt ttgggttttt tgttttagtt 3000 

tagtttgttt tttttatttt tagtttttgt ttagtttttt tttttttttt tgttttggtt 3060 

ttgttttatg attttttaga gattgagatg ttgaggggaa agtttttttg ggattttggt 3120 

atttgagtgt ttttttttga agaatttggg tattggagag ttttatgttt ttttttgaaa 3180 

gatttttttt agttttttta ttgttttttg tggtatttga ggattggtat ttgtgttttg 3240 

gttttgtttt tatttttatt ttggagagtt gtttttgtgt tttgggaatt ttagattttt 3300 

ttttgtggtt ttggtattag aggttttttt ttattattta tttttagtat ttgggatttt 3360 

aatgttttgt tttgtttttt tggtggtggt gtttggtttt ttttggagtt ttgattttgg 3420 

gaaagggagt gggttttttt tggttaatat ttatttttag aagtagtaat agtagtaggt 3480 

gtggtttgtt tatggtgtgg ttggtggtat ttgtttttat tgtttgtttt ttgtggtagt 3540 

gtttgatttt tagtttggtt tgttttgtgg attgatgggg ttgtgttgtg ttgtgtttta 3600 

gtgttttttt tgtttgttgg agttggttgt ggtttggttt gttttggttg tgggtgggag 3660 

aggttgggtg aagttagtgt tgtggttttg tttttgtttt gtttagtttg tttatttttg 3720 

gaattgtgtt tgtttgtttt gttgttttgg ggtgggaagt agaggtaaag ggagggtgtg 3780 

tggttttttg tattttgttg tgtttttttt tgttttgttt tttagagttt tttatttatt 3840 

ttgttttggt atttagtttt tggtttgtgt tattgtgtgt ttgttttgga tgttgtgggt 3900 

ttggggatta gggtgttttt attgtggttt ttgttttttt ttggttgtgg tt 3952 



<210> 52 
<211> 2436 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 52 



agttttggat gataaggatt aattgtttag ggtgttggtt gagaggaata ggaagagtgt 60 

tttgtggagt agggtgggtg agaagagttg ggtggttgtg gggtttttgt gtatgttggg 120 

taggaagagg aagagttgta gtatgaaggt tattagggag atgtataaga agtaggtggg 180 

ggttattatg ttgagtagtt gtagggttag tatttttgat tatatttttg tggggtttgg 240 

ggtgagaaga ggattgattg attatggtgt ttagggggtg tttttgtttg tttttttgtg 300 

ttgtgtgttt ggggtattgt atgtgatttt atgtttttaa ttttttgttt tgtggatttt 360 

ttttttttag ttttttaatt ttggattagg taggttagag atttgagttt gtagtggttt 420 

tgggtttagg ttttatttgg gggtgagtgt tgtttggggt tttgagtgag tttggtgttt 480 

tggtgtggta gagtttggta taggagggag ggatttgttt gtagttttta ggagttgtga 540 

gggtgggggg tattggggga taggattttt tttttttgtg gatgtggaaa taagtttagg 600 

tttgggattt ttggtttaga tttttttagg aggagaaagg gaaggatttg gggttggggg 660 

taggagaatt agtggtattt gatttgttgg aaattttgtt ttatattttt ttttttttgt 720 

ttttgtgttt ttatttatat atatattttg ggtagtattt agggatttta ttaggttatt 780 

taaggttttt tggaggtatg tgtgtgtgag ggattatgat gtgtgtgtgt gtgtttttgt 840 

ttgtgatttt gtatgtgtaa tttgtgtgtg tttgagtatt tgtatttgtg tatgtatata 900 

tttgtgtgta tagatatgta gtttttgtta tttagtttta aaaaaattaa ttgaaataat 960 

tgatatattt atatttgttt ttgggtattg ggtgaggtgg gggtgtaatt agaatatttg 1020 

tgttagtggg ttttggttag tatatttttt taatgttggt tgtttgggtt atttttttgt 1080 

gaagagtgtt tttatttgag gtgaaattaa aatttttttt ttgtgtttgt tttttgtttt 1140 

tttgttgaga gaaatttaaa taatttttat ggtgttgaaa ttttttttat gttgataagt 12 00 

ttgttttggg tgtatgagtg gatgtttggt tagttttttt tgggattgat tttgttgttg 1260 

tgtttagttt ttattatgtt tttattttat tttatgtttt atagttgggg tttttggtta 1320 

gttttggaag ttattgagaa ataggatttt gtgtgtttga gagaattttt ttaggggtta 1380 

aggaattggt tagttggagg tgtgagaaaa gttttgggaa ggtggttgta tttaggatgg 1440 

gtggatgtta tggggttttt gtttaggttg ttttgtttgt atgggttgat tggttatttt 1500 

ggaatttttt ttgtaggttt tagtgttttt tgggaatttg tagtttttgt ggagagtgtg 1560 

ggtttttttt tattgttggg gtgtagtgta gtgtatgttt gagggtggtt gttgggggtt 1620 

gggtatgttt ttagttttgt gttgttgggg gttgtggtgg tgttgtttat tttagagagt 1680 

ttttggtttg gggttttggt gaagtaagtg tttttttggt gttggttgtt aggggggtgt 17 40 

gggagtagtt agatgtgttg tagtgttggg aaggtggtga aggatagggg ttaggggagt 1800 

gaggggtgtt tggtaggtag ttttagtttt ggttttgtgt gggagaaggg atagtagaga 1860 

ttgtttgtgg ggttttgggg tgtagggagt tgtttgttta gttttggtgt ttgttttgtt 1920 

tgtggtagag ggtggtatag ttggagtttt ggaaagattg gtagtgttgg tagttgtggg 1980 

ttttttggtt attgtttttt ggatgtatgg gaagttgttt tttgtgttgt tgttgttgta 2040 

ttgttgttgt tgtagagggg tgaggaaatt aatttattga gttttggttg ttgataagag 2100 

gagttttgga tgttggtttt tgttttgttt gaggttgtaa agttgtggat ttggtttggt 2160 

tggtttgtag tttgtgtttt gttttgggaa ttgggtaagt agtggggatg tggggaggag 2220 

ggagtgggta gttgttggtt ttttatttgg gtgttagtga gtaagggtta ggaagtggtg 2280 

gggatggtag ttgggtattt tttagttttg ttattttttt ttttgttttt gggggtggtg 2340 

ttttggttgt tagttttagt attgtggatt tgtttttttt tttttttttg attttttttg 2400 

agtgggttgg ggtgttttgt ttggttttag tgtttg 2436 

<210> 53 
<211> 2436 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 53 



tgggtgttga aattaggtag ggtgttttag tttgtttaag gggaattgag aggggaggag 60 

ggggtaagtt tatgatgttg agattggtgg ttagagtatt gtttttagga gtggagggag 120 

gggtggtggg gttggggggt gtttagttgt tatttttgtt gttttttggt ttttatttgt 180 

tggtgtttaa gtgggaagtt agtagttgtt tgtttttttt tttttatatt tttgttattt 240 

gtttagtttt tgaagtgaag tgtaggttgt gagttagttg ggttgagttt ataattttgt 300 

agttttgggt agggtgagag ttggtgtttg gggttttttt tgttggtgat tagagtttgg 3 60 

tgagttgatt tttttatttt tttgtgatgg tggtagtgtg gtggtggtgg tgtggggggt 420 

ggttttttgt gtgtttggga ggtagtggtt gagaagtttg tggttgttgg tgttgttggt 480 



ttttttaagg 
tggatagttt 
agggttaggg 
tgttttttta 
ggaaggtatt 
tgtagttttt 
gtgtattgtg 
tttttggggg 
atgggatagt 
tagaattttt 
gtgtatggaa 
tgtggggtgg 
aggttggtta 
ggtgttatga 
gggattttaa 
gtgttggaga 
tttatttggt 
gttagatagt 
tttaagtgta 
tggtttttta 
gttgtttaag 
ggatttttag 
ttttttttta 
gaggggagga 
gagttttttt 
taggtaatat 
ggtttgtttg 
gggtgtggag 
ttttggatgt 
aggatgttgg 
ttggtgattt 
gttgtttggt 
aatgttttgg 



ttttggttgt 
tttatatttt 
ttgaggttgt 
gtgttgtggt 
tgttttgttg 
ggtggtgtag 
ttgtgtttta 
gtgttgggat 
ttggatggag 
tttgtgtttt 
ttttattttt 
gatggggatg 
agtatttatt 
gggttgtttg 
ttttgtttta 
gatgtgttgg 
gtttgagagt 
agaggttgtg 
tatagattat 
tgtatatata 
atgtgtgtgt 
taggttaagt 
gggaggtttg 
gttttatttt 
tttttgtgtt 
ttatttttag 
atttggggtt 
ttgtgtgtag 
tgtagttagt 
ttttgtggtt 
ttgtgttgta 
tttttttgtt 
gtaattagtt 



gttgtttttt 
ggggttttat 
ttgttgagtg 
gtatttggtt 
gagttttagg 
gattgggggt 
gtggtaggga 
ttgtaaaggg 
gttttgtggt 
tggttggttg 
tggtggtttt 
tggtgagggt 
tgtgtgttta 
ggtttttttt 
aatgaaaatg 
ttaggatttg 
gagtgtgagt 
tgtttgtata 
atgtatagag 
tttttaaaga 
gggtggggat 
gttgttgatt 
aattagaaat 
ttggtgtttt 
agattttgtt 
gtggggttta 
agggaattgg 
tgttttagat 
tagttttttt 
gtttaatgtg 
gttttttttt 
tgttttgttt 
tttgttattt 



gttgtgggtg 

gggtggtttt 

ttttttattt 
gtttttgtgt 
ttgagggttt 
gtgtttagtt 
gaggtttatg 
gattttaagg 
atttatttat 
attttttagt 
tgaagttggt 
tgagtatggt 
gggtgggttt 
agtaggaggg 
ttttttgtaa 
ttgatatagg 
gtattgattg 
tgtagatata 
ttatagataa 
gttttgggta 
gtagggatgg 
tttttgtttt 
tttaaatttg 
ttatttttat 
gtgttaaggt 
gatttgaggt 
ggaggggaaa 
gtgtggtatg 
tttattttgg 
gtggtttttg 
tttttgttta 
tatggggtgt 
agaatt 



aggtgggtgt 
tgttgttttt 
ttttagtttt 
ttttttggtg 
tttggggtgg 
tttggtgatt 
ttttttgtgg 
tgattagttg 
tttaggtgta 
ttttggaaaa 
taggaatttt 
ggtggggttg 
gttggtatgg 
tgggaggtgg 
agaaatagtt 
tattttggtt 
ttttaattga 
tatatatata 
ggatatatat 
atttggtgag 
agaaagggga 
tgattttaaa 
ggtttgtttt 
aatttttaaa 
gttgagtttg 
tgttgtgggt 
tttgtgggat 
gaggagtggg 
gttttgtagg 
tttatttttt 
gtatgtgtga 
tttttttatt 



tagggttgaa 
ttttttgtgt 
tgttttttgt 
attggtgttg 
gtagtattgt 
atttttaggt 
aggttgtggg 
atttgtgtgg 
attgtttttt 
gtttttttgg 
ggttgtgagg 
attttaggaa 
aaaggatttt 
gtgtgaaggg 
tgggtagtta 
gtgtttttat 
tttttttaaa 
aatatagata 
atatatattg 
gtttttgagt 
gtgtgaagta 
tttttttttt 
tatatttata 
gattgtaaat 
tttaaaattt 
ttgggttttt 
agggggttgg 
tgagggtgtt 
aatgtaattg 
gtgtattttt 
ggattttgtg 
ttttttggtt 



<210> 54 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2436 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 54 



agatttattt 
ggtttatttt 
ttatttttga 
ttggattatt 
gttatatatg 
aaattaaatt 
ttgtaaaaat 
gataaataat 
tttttatttg 
ataaaagaaa 
ataagtttaa 
tattattgtt 
gagttgtgag 
ttttagtgtt 
tttgatttta 
tgtgttaatt 
tgtaggggat 
aaaaaaattg 
attgatattt 
tatttggaaa 
agagatttgt 
aattttttgt 
ttttttggga 
tttttttagt 
gataggtaag 
gttttgtgtg 
taagagtttg 



aaatttaaga 
aatgtttgga 
attaagtaat 
aattgtgtga 
aaggttgaag 
ttagtgtttg 
atattgattt 
aaaagaaatg 
tttattattt 
ttttataaat 
attaaattaa 
ttgaggtttt 
gatattttta 
atattgatgt 
aagaaaagga 
gtttgatttt 
ttttttatgt 
tatatatgta 
ttaagaagtt 
gattgaagga 
ggtagaggaa 
ttagtttttt 
gtgtagtggg 
gtttttttta 
ttttgtgggt 
tttttgagaa 
ggggtttttt 



atgagaatat 
gtatatattt 
tttgaattta 
ttttgggata 
tttgattttg 
gaatggtagt 
ttaaatgatg 
aaagatgtat 
aaaattaatg 
gtgtatgtat 
ttaaagaagt 
atgtttaata 
tatttgtgta 
gtattttgga 
aaatttaaag 
tttttttgta 
gttgatgttg 
tgaaagatta 
tttgttattt 
aaggaagaat 
aagtattttt 
attattaggg 
gaatttttat 
ggttggtagg 
gtgtaaatat 
gaagtgattg 
ttttttaatt 



aaatttatat 
taatgaatat 
aagttgttat 
gtaatttttt 
ttttgtgatt 
ataataaatt 
taattgatag 
atggtgagaa 
attatgttga 
aggagtttaa 
tgagaggaaa 
taatttttta 
gaaggaaaag 
tttattttgt 
tttttttttt 
aggttttatt 
tttatatagt 
tggttttatt 
aggaaaatta 
gaagaaagta 
tttgaatttg 
ttttttgaag 
attttttttt 
agttttgagt 
gttgtaatta 
tatttaattg 
tagaattagt 



tttgaagtgt 
ttattttatt 
gattagtatt 
gtattttagt 
attattttaa 
tattaagaat 
ttatattatt 
ttgaaattat 
atgtttataa 
gttattaaaa 
aattggtttt 
agtagaggtt 
tttgttattt 
ttttattgta 
ttaaggggat 
ggaaattttt 

ggggtgggtt 

tttggaaagt 
ttaaatattt 
gaatttagat 
ataagggatt 
ttgggttttt 
taggtttttg 
ttgatatttt 
agttttttgt 
tttattgttg 
tgtatggggg 



tttatagaaa 
ttattttttt 
gaaaagatta 
ttgtttatat 
atatttgatg 
aaataattta 
gtagagggtt 
tttgataagt 
attataaaat 
gttttaaagt 
tatttttaat 
ttagagagaa 
attttagtat 
taaatttata 
agaaattttt 
tgtaatataa 
tgattgaaga 
atgaaaggtg 
tatttagaga 
ttatgtgggg 
tgtttggggg 
tattgggtag 
aaggattttg 
tttttgatgg 
tgattttata 
gtttgttttt 

gtggggaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



gggggtgggg 
agtttttgaa 
atgttaagag 
agttgttttt 
tgtggagtgt 
ttgagttttt 
gtatttgaat 
gatgtaatag 
agttttgtat 
ttattatgtt 
gtgaattttt 
ttttttttaa 
gaagtaattg 
gtaggtattt 
ggattttggt 



aaggagtggg 
tggggagtgt 
gtgggagtgt 
ttgttttttt 
ttttggagtt 
tttttggttt 
ttgtaggtga 
ttgagaatat 
ttttttgtat 
ttttatgtga 
tttttaaatt 
aaaataaaaa 
gtagaaggtt 
aaaaaatgta 



agggtagtgg 
tgttgttttt 
ttggggaggg 
gtttgttttt 
tgtgaatgaa 
tgatggtata 
gttgttttga 
taggtgttgt 
tttttagtat 
tttgtttggg 
gtaataagtt 
taaaaaattt 
ttaattaagg 
ggtaattttt 



tttttgtgag 
aagttgattg 
aggtaggtgg 
taagtttgga 
tttttttttt 
gtgatataat 
gtttttttgg 
ggataggagt 
tgttttgtat 
taatgttgaa 
gttttttaag 
atagaaaaaa 
taaagagttg 
ataagggttt 



tagagtgatg 
gtattttttg 
tttttattgt 
tttttaggag 
tttttttttt 
gatgatgggt 
ggaagaattt 
tgggattaag 
tttttgtatt 
tttagttgtg 
gtaattatgt 
atttgtgagt 
taaggtgaag 
ttggggagag 



ttattgagtg 
ttaggaagaa 
aggtgtgggg 
tggttgaagt 
tttttttttg 
gttataattt 
taggtgtgtg 
atttttggtt 
tttttttggg 
tagtgttgta 
tttttttgtt 
ttagaaaaaa 
ttaagaaaat 
gtatatagag 



<210> 55 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 55 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



attaaggttt 
aagtgtttat 
taattgtttt 
ttaagggggg 
gaaaatttat 
agtgtggtgg 
atgtggggtt 
ttgttgtgtt 
gtttaaatgt 
ggagatttag 
gtgttttgta 
aaaggtagtt 
tttttggtgt 
tttagggatt 
ttttattttt 
taggttttta 
tatataaaat 
tttgtttgtt 
attgagaaaa 
ttttagggaa 
ataggaaatt 
ataaattttt 
tttttaagta 
aagtattaat 
tgattttttt 
gttttttgta 
agttgatata 
tgaagttaag 
aatattaggg 
tttataattt 
aatagtagtg 
ttaaatttat 
ttttttttat 
taggtagaag 
gttatttatt 
gtttttgtag 
aatttgattt 
tatgtataat 
aatagtttag 
ttaggaatgg 
aagatagatt 
gagtaaattt 



ttttgtatgt 
atttttttaa 
ttttttttaa 
aataaaagaa 
tgtagtgttg 
tttggggaag 
ggttgaagat 
tgtatgtttg 
gggttgtgat 
tgagaagaag 
gttttagtta 
ttttgtgttt 
ttttttttga 
tatttagtaa 
atttttttgt 

gggggtaggt 

tgtaaaatta 
ttattaaagg 
tgagattttt 
ttgtttaatg 
tttttaggta 
ttttatataa 
tttttgagta 
tattttttat 
tttttagtta 
ttgtgttata 
aaggattttt 
tatgagttta 
atattagaat 
ttttttttga 
attaaagatg 
gttttaaaat 
attttgtaat 
atttgttagg 
agttttttgt 
tgaattattt 
tattagatgt 
atgtaaataa 
tggttttttt 
agagaaataa 
tttttgtgaa 



ttttttttaa 
ttttgtttta 
gtttgtgggt 
atgtgattat 
tgtgattggg 
gatgtggagg 
tttggtttta 
gagttttttt 
atttattatt 
gaggagggag 
tttttaagag 
gtggtaggga 
taagaagtat 
tattgttttg 
tttttgtgta 
tagtaataga 
gtaaagaatt 
gaagtgttag 
tggggattta 
gggagtttgg 
aattttttgt 
gtttagattt 
gaatatttga 
gttttttgga 
ggtttatttt 
aaagattttt 
gtttttttag 
tataatagga 
ggatggtaaa 
agtttttatt 
aaagttaatt 
ttttagtaat 
ttgtaggtat 
ataattttag 
agtaatataa 
atttttagta 
ttagaatgat 
attaaggtgt 
agtgttaatt 
aataaaatga 
atattttaag 



aagtttttat 
tagttttttg 
ttttttttat 
tttggaaggt 
tttggtgttg 
gtgtgggatg 
gtttttgttt 
ttagaaaggt 
attatattat 
aggaggaggg 
tttaggtttg 
ttgtttgttt 
taattggttt 
tttgtggaaa 
gttagttttg 
taattgagta 
tggttatagt 
gtttaaggtt 
gagggaaagt 
ttttaaaaga 
tggatttaaa 
tgtttttttt 
taattttttt 
aataagatta 
attgtgtaaa 
aataggattt 
aaaagaggga 
ataaaataaa 
tttttttttt 
tagaaaatta 
ttttttttta 
ttagattttt 
ttagtatggt 
tttttattat 
ttgttagtta 
aatttattgt 
agttatagag 
agaaagttat 
atagtaattt 
atgtttattg 
atgtggattt 



gaaagttatt 
ttttaattaa 
aagttttttg 
ggtttattgt 
tttaggtggg 
gtgttggaag 
ataatattta 
ttggggtagt 
tgtattgtta 
tttatttata 
gaaagtaggt 
ttttttttag 

ggggatagta 

ttattgtttt 
ggttagggga 
tgattatttt 
gtgtttatgt 
tttgttaatt 
gtaagaattt 
ttttggtaat 
gagaatattt 
gttttttttt 
gagtaatagg 
taatttttta 
tagtattaat 
tatagagaaa 
ttttggattt 
tttaaggtgt 
tatataaata 
tattaaatat 
atttttttga 
gtatatgtgt 
tattgatttt 
gtgtattttt 
tattatttgg 
attattattt 
tagaattaga 
tgttttaaag 
tagatttaag 
aaatatatat 
gtatttttat 



tgtatttttt 
agttttttat 
tttttgtttt 
agtttggggg 
ttatatagga 
atgtgggagg 
atgtttttgg 
ttgtttgtaa 
gagttgagga 
ggttttaaaa 
ggaggggtgg 
gtatttttat 
gtgtttttta 
tttatttttt 
aaggagtttt 
tttttgggag 
gtttatagag 
tgaaagagat 
tttattgtat 
aaaaggttgg 
tttttttgtt 
ttttttagtt 
gattttttgg 
tgtgtatatg 
atatggaaga 
agggttaaat 
tttttttttt 
atattagtat 
tgaaagtatt 
aggattttaa 
ttagtttggt 
atttataaga 
aagtgataaa 
tatttttttt 
aaattaatgt 
taaatattga 
ttttaatttt 
ttatataatt 
attgtttgat 
tttagatatt 
ttttaaattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



<210> 56 
<211> 2470 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 56 



aaaattgaat atttgagaat attggattta tatttttgag ggtaataatt tgttagagtt 60 

gagtagttgt tttttttagt tatggtatgg atattttaat ttgttatttt ttttattatt 120 

ttttattgtt ttatataagg ttagttttat ttatttgttt tatttgtttg gtttttgtag 180 

ttatttgaat ttgtaaatat tatttagatt ttttttgttt tttattaaag tttattttgt 240 

tttgttttaa gttttttatt agtgttttaa agtaagagtt aggtaatttg attaaatata 300 

ggttttttaa tatgtgtttt ttggagatag tattttaata gaaagagatg attttgtagg 360 

tagaaattat aggtattatt ggattataat taggatggta attgtttgtt ttattttttt 420 

tgttattttg atgtagttat attttattgt tttaaagtag attaaaggaa ttggttttat 480 

ggtagaataa tggatgttgg ttttatgtat gaaatttttt taagttgtat ttttatagat 540 

tattttagtt ttaaaagatt taaagttatt atgggttttt ttgatttttt tatttttttg 600 

gaggttttag gatatttggt ttgatggtat ttttttttat ttgtttttgg ggtttgtttt 660 

gtgttgggta gttgtgttta aggagtgtgt gtatttagta tgagtttatt ttttatgggt 720 

gtttttagag gtttggttgg gttgtgtata ttaagtgtag taagtttttg tattaatggt 780 

ttttgtggtt gggtggggtg tggttgttta aaggtgttgt ttttttggaa gttttgtttt 840 

tattttagtg tttgagagtt tgtagttggt tgggaaggtt tttttggttt gtttggtttt 900 

tttggggaag aggttaatat ttggtatatg tgatttatgg tagaggggag ttttgggtgt 960 

gtagaattgg ttgtgttttg ttgagagttt tttgtgggtg gggagagttt ttttattttt 1020 

ttgtttgttt gagtgtttag agtttagggt tgttgattgt agtatttttt gatttgttgt 1080 

attgagggtt tgtggttttt gtgtgtggtt tttattgttg ttggaggtgg ttgtaggtga 1140 

attgttgggt tgttagttat gatttgagtt aggtattttt ttgtttttgg gattggggtt 1200 

gaaggttaat tatattgtag ttaggttttg tgattggatg gtagtgagag ttgattggtt 1260 

gttgttgtga gttttgggtt tttttttttt tttttttgtt ttaattttag tgtttaagtt 1320 

ttaggttaat gagatagtgt tttatagatg tttttttttt gttttttttt ttttattttg 1380 

gagagggagg ggaagttttg attggttaga ttgtttttgg aagttttttt ttttttatta 1440 

attattgaag gaggtatgtt tttagtggag ttttttagtt aattataaag tggtggtggt 1500 

gtttagttgt gtagttttat tagtgttttt gggttttatt gtttttaatt ttttaagttt 1560 

ttttgtaagg gttggtgatt ttgattggtt attgttgtta ttgttgaatg ttttttttag 1620 

ttagttattg aataaggtga atttattttt tttgtttggt tatttttggt attttttgtt 1680 

ttttgggtgt ggttttttag tgttattttt taattggttt tttaggttga ttggtttttt 1740 

ttgggaggag ttgtaaataa atgatgtttg tgattggttt tgtttgggtt ttggttttta 1800 

gttgaagtgg gtgagtgtgg ggtttggttg gtgattttta gatgtttgtt atgtgggtgg 1860 

tggggtgttg ggtggtgtaa ggttgggtgg gggaggaagg aggtggagga tgagtaggag 1920 

gggggaggag gagtggggaa gtgtaaggtg gttgtgtaga tagtgttttt tatatagagt 1980 

agtttttgat ttgggtgaat gtgggtttgt gttgttgttg ttgttgttgt tgtttgggtt 2040 

aagtgataaa ggaaggaagg aagtgaggag gagttggttt tgtagttgtt gatagggttt 2100 

tgggttgggg taaagtgtgg atattttttg agtgggtatt gagtagagtt gaggggtggg 2160 

agggtggttg agttgttgtt gtggatgggg gagggggttt tgagggatgg aagtggttgt 2220 

tgggttttta tgtttttggt gaatggggaa tagttgagga gttgttgttt ggggtttgaa 2280 

gggagttgtt tttgttattg ttatggttgt tggatttagt tgttgtttgt agttgttttt 2340 

ggtgtaatga ggagaggagt tgttgttatt gttattgttt gtttttgatt gatttgtgat 2400 

tttgttgttt tttagtttgt tgggtttttg ttgttagttt gttggatttt gtggtttgtt 2460 

ggagttgtag 2470 

<210> 57 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 57 

ttgtagtttt ggtaagttgt gggatttggt gggttgatgg taggggtttg gtgaattaga 60 

gggtggtgga gttgtgagtt agttagaggt gggtggtggt ggtggtggtg gttttttttt 120 

ttattgtgtt aggagtagtt gtaggtggtg gttggattta gtggttatgg tggtggtgga 180 

ggtagttttt tttagatttt aggtagtggt tttttgattg ttttttattt gttggggata 240 

tgggaatttg gtaattgttt ttgttttttg gggttttttt ttttgtttgt ggtaatagtt 3 00 

tggttgtttt tttgtttttt ggttttgttt ggtgtttgtt taggaagtgt ttgtgttttg 360 

ttttagttta gagttttgtt agtggttgtg gggttggttt ttttttgttt tttttttttt 420 

tttgttattt ggtttgggtg gtggtggtgg tggtggtggt ataagtttgt atttgtttgg 480 

gttaggagtt gttttgtgtg aggagtgttg tttgtgtagt tgttttgtat ttttttattt 540 

tttttttttt tttttatttg ttttttattt tttttttttt ttatttagtt ttatattgtt 600 



tagtgttttg ttgtttgtgt aataggtgtt tgggaattgt tggttgagtt ttatgtttgt 660 

ttgttttggt tgggagttga agtttgaatg gagttaatta tggatgttgt ttgtttgtgg 720 

ttttttttgg aaaaagttga ttagtttgag aaattaatta gaaagtgatg ttgggaagtt 780 

atgtttaaag agtggaaaat gttgaaggta gttgaatgga aagggtgaat ttgttttgtt 840 

tggtggttgg ttggaggaga tatttgataa tggtaatagt ggttaattag agttgttggt 900 

ttttatgaag aggtttgaag agttgaggat ggtgaaattt agagatgttg gtgaaattgt 960 

atggttgggt gttgttgttg ttttgtgatt ggttggaaaa ttttgttggg agtgtatttt 1020 

ttttggtgat tggtgaagag aagggggttt ttgaggatag tttggttagt taggattttt 1080 

tttttttttt taaggtgggg agaagaaagt gaagggaggg tgtttataaa gtgttgtttt 1140 

attggtttga agtttaggtg ttggagttgg ggtaaaaggg agagggaggg agtttgaaat 1200 

ttgtggtggt ggttaattgg tttttattgt tgtttaattg taagatttag ttgtagtgtg 1260 

attggttttt ggttttggtt ttaggagtgg ggagatgttt gatttgggtt gtggttgatg 1320 

atttgatggt ttgtttgtgg ttgtttttaa taatggtgga aattgtatat agggattgtg 1380 

ggtttttggt gtagtaaatt ggaaagtgtt gtggttggtg gttttgggtt ttgagtgttt 1440 

aagtgagtag aggggtggag gggttttttt tatttatagg aggtttttgg tggggtgtag 1500 

ttaattttgt gtgtttaagg tttttttttg ttgtggattg tgtgtgttga gtgttggttt 1560 

tttttttgaa agggttagat ggattaaggg ggttttttta gttggttgta agtttttaga 1620 

tgttaaaatg ggaataaagt ttttagaaag gtagtgtttt tgggtggttg tgttttgttt 1680 

agttgtgggg gttattgata tagaggtttg ttgtgtttgg tgtgtgtggt ttggttgggt 1740 

ttttggaaat gtttatgaga aatgagttta tgttggatgt gtgtgttttt tgggtgtaat 1800 

tgtttagtgt aaaatgggtt ttgaaagtag gtgaggggaa atgttgttag gttggatatt 1860 

ttgaagtttt tgaggaaatg ggggggttag aggagtttgt ggtagttttg gattttttgg 1920 

aattagggtg atttataaaa gtgtagtttg agaagatttt atatataaga ttagtgttta 1980 

ttgttttatt atagaattag tttttttggt ttgttttaga atagtgaggt atggttatgt 2040 

taagataata gagaaagtag aatagatagt tgttattttg attgtaattt agtggtgttt 2100 

gtaattttta tttgtagagt tatttttttt tgttaagatg ttgtttttag aaagtatatg 2160 

ttaaaggatt tatatttgat tagattattt ggtttttgtt ttgaaatatt gatgaaggat 2220 

ttgggataga ataaagtaag ttttgatgga agataggaaa ggtttagata gtgtttgtag 2280 

atttaaatgg ttatagaggt tagataagta aagtaaatga gtgagattgg ttttgtgtaa 2340 

gataatgggg aatgataggg aaaatggtaa attggaatat ttatgttata attaaaagga 2400 

atagttgttt aattttagta gattgttgtt tttaggaatg tgggtttagt gtttttagat 2460 

gtttaatttt 2470 

<210> 58 
<211> 4721 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 58 

atgtgaagtt tagtattttt ttaagttttt gtttatatat aaaaagttat gttttagagg 60 

ttttattttt aatttaagat aagtaggata ttttaaaaat tatttattta ttagttggtt 120 

tgagaaaatt tattgggtat ttggagtaag gagggggggg agattttgaa attaaaaaag 180 

taagagaaaa taaaggttta gttatgagaa gagagataga aatttatgat aatagaggta 240 

tttttttagt ttaattgttt gttttagttg tgggggttgg aggtagggtt tgaggtggtg 300 

attgttttga tatttagttg ttagataaat aaaagaggtt gtttggtgtt agttttggtg 360 

ttttgttttt ttatgtttgt ggttgtagtt gttgtgtttg gttggtttgg ttggaaatta 420 

ttggtttggt tggagttgtg tattttatag tttatagggt ggggggtgat tatgtttttt 480 

ttttgttttt ggattttaga tttagttttt agtaattttg gttttttgtt ttggtttgtt 540 

ttttaggtgt gtaatttatt tttgtttttg ttggtttgtt tggtatttgt gtgggggata 600 

tggggttgtt tatttttagt gtatttggga gatgtatttt aagtgtgttg gggaatgtgg 660 

gggatttgtg ggtgtttggg gtgaattttt atattatatt ttgtgtatag agatgtagag 720 

taaaggaagg aatggagaaa gttgagagga taaaaggaaa ggagttttag ggttttagtt 7 80 

gagaagttta ggttttagtg tagtaagagg tgataatgtt attgaggtgt tggggtttta 840 

ttaatttata gatttatttg tgtggaagga aggaaaaagt aggggatatt taattttttt 900 

tgtttgtttg tttttttgta ttttggggtt ggtagagttg aaaggagata gatatttagt 960 

ttggtttttt atttaatttt ttttgtttta aaattttaaa ggtgtgtggt tatgatagta 1020 

tttgggttat tggtgtttga tgtgattttt tgtatttatt aaaggagtaa atatggagat 1080 

tttttagttt ttatagtatg ttttttttag ttttttttgg tgggtttgga ttaaatttaa 1140 

tttggttata tattttaatg tttttttggg tgtggggagg tgtgattttt ttattttttt 1200 

ttgtattttt ttgttgattt ggaaatgaag gtgtttaatt tttaattgta gattgtagga 1260 

gtagtgtttg tagatttttt tggggtaata taggagaaaa tttggaaggg aatttttagg 1320 

gaatgttttt tgagtttttt tttgtttttt ttttattttt agtttttagt ttttttttag 1380 

gagggtatgt atattttttt aggagggtat gtatattttt ttaggagggt atgtatattt 1440 

tttttggagg gatggttgta taaatgtatt gttttagggg tttttttttt agtattggtg 1500 

ttttggaaaa tgttgggggt agggggataa ttttttattt agtttgtaga gttggttaag 1560 

gtgatggggg ttgtagtata ttagtttggg gttgattttt ttgtaagatt tatagtaata 1620 

tagttttttg atttttgagg aatgtttgag atttggttat ttgaatattt tgtatagtag 1680 



gagggggagt gagtaggatg agggtggttg tgtaggttgt gtgttggtgg gtgtttttgg 1740 

gtttagtttg tagtttaggt ggttgggtag taggaggtat gtggtgtgtg ttttgttgtg 1800 

ggattttatg gtttggagta gtagttttag ttgttgtttt tttagttttt tgagtattga 1860 

gtgtgtgagt gtttttagat ttgttttggt gtttttggtt gtgttggtgt ttgtggttgg 1920 

tgggtgggga tggtggtgat ttttggtatt ttgtattgtt ttgtttaggt agtatttagt 1980 

tttgtttggg ttgttgttat tggttttatg tgtttggttg tttgttgttt tttttttttg 2 040 

tagtttgttt ttttttttat ttttttttgt gttttttttt ttgtttttgt tgttgggtgt 2100 

atggtttttt tagagatgtt ggatgagtgt agattgtttg gttttgaata tgtggggtga 2160 

ggaggtgagg agaaaagttg tttgtttgtt aaggagtgaa tatgattttt gtatattatg 2220 

aagaagttgg gtgttgagtt ggggtagtag gtgtaggtga tagtagtagt agtaggggtt 2280 

tgggtaggag tggtggtggt ttgaggggtg ttttgtggta tgtgttagtt ttttggaggt 2340 

tggggtgtgt gtgggggttt gggggtgttt gttggggatt gggggtttgt gttttggttg 2400 

ttttattttg tgtggtttgt gttttgtgtg gttttttggt tttggtgtgt tttggaggaa 2460 

tttggttgtt gtttttttgg ggatggttga gtttgttttt gttggtgttt ttttaaaaag 2520 

aggttttttt gtagtggttt ttgaatgttg ggtttgttag ttttggtttt ttatatggaa 2580 

ggtttgtggg ggtaaaaaat gaaaggtgtt tggttgggtt gttggaagaa ggaaaaagtt 2 640 

tttttttttt ttgttaagta atttaatttg ggggtgggga gaagtaggta attaaaaaaa 2700 

aaaaagtaag tgatttattt ttttttttta tatttttagt aattggattt ttttgaattt 2760 

tgtgtatatg aagatttagg ggagggggtt gagtggattt tattttgtat gagatgtttg 2820 

gtaaaataag aaggtttttg taaaatttaa taattaaata tgtaaagttt taaatgaaaa 2 880 

ttattatttt tttgaatttt agaggtttgg gggtgtttgg ttggaattgg ggtttaaaaa 2940 

aagaaaatgt ttataaagta taataagatg tttgatgggt ggaaaaatgt atttatgagt 3000 

tatatttttt tgtttttttg taaagttttg ttggtttttt tttttttttt ttttgttaaa 3060 

aaaaaaaaaa aaaattgtgt atttttttaa tttatagaaa gttttggtta gattgtttta 3120 

attttgtgta tttgggtggt ttaggggagt ttttggtttt tttttttgtg tttttggggg 3180 

tttaggtttt tggtggggat ttttttgagg ttggtgtggg tgtaggggta gaagatgttg 3240 

tggtggtggt tgagtttggt ggggttgata gtgtggggga gggtggtgtg gtggtggtgg 3300 

agggttttag atgggttgtg tgtttttgtt ttttttgggt ataagttgtt tgttagtgta 3360 

ggggttgttg atgttttttt ttttggttgt ggttggttgt tgaggttttt tgtatgggtt 3420 

gtttgttttg atttagttgt ggtggtagga ggttttggtg tttttttggt gttttttttt 3480 

ttgagatttt ttttgttgtg tttgggagtt tttttgtttt tggtttgttt ttttttggtg 3540 

ttttggtttt tttgttgttg ttaggggttt gttgtgttgt atttaggggt ttaggggttg 3600 

ggtgtttggt ggtttggtgt tgatggattg gttttgtttt gggtagtttt tttttgtgtg 3660 

gtgagtggat tgagtatggt gtttggttgg tttggttggt gtggtttggg gataggattt 3720 

tttgtgtgtt gagtagtaag tgagtgtgtt tggggtttat tgttttttgt aaggtttttt 3 7 80 

gtttttgttt tttttttttt tttttttttt tttttttttt ttttgttttt agttgttgta 3840 

gttgtgttgt tttttttttg tttttttgta tttttttttt ggttttttgt ttggttggtt 3900 

ttattgtgta gtggtgtgtt tggttttggt tttgttatgt ggttgtttag atattttgtt 3960 

gttttaaaaa aaaaaaaaaa gtgattgtgt tttgtaaata atagattggg tttttaaaag 4020 

ttattttttt ttttaatttt ttgttattgt tatttttttt tgggtttgta gagggggtat 4080 

tgatggaggg gagagagata ggtggggggt agagaagttt ttagaatgga ttgagttttg 4140 

gttggagtta tggagaaatt ggaaaagtag ggagtattga gtgggttttg ttgtgagttt 4200 

tggagttgag tgagtgggtt ggtggtttga ttttgatttg gttgggtttt gtggtggtta 4260 

ttttgtgtgt gtttgtttta gtgttttatt tattggtttt gttttaaagg ttttggtggt 4320 

ggattttttg gttgtgtttg agggtaaggg gaggagagtg ttgttttggt ttaaaagata 4380 

tttatattgg attggattga ggttttggga agtgtgtgtt gaggggaata gttgttgagg 4440 

gtggggaggt ggtgtgaata tgattttagt ggtggttgtg tgtttttttt tgttttttta 4500 

gttttgggtt ttggttttta ggattgtttg gagaagtgtg ttttgtgtat agttttggaa 4560 

tgtatttggt tggttgatga gttgtgaggg gtagtatttt ggtgggagaa ggggagtggg 4620 

ggtgggggtt tgtttgtgtg ttgtgggtag gttttttttt gggtttggaa gatttttgtt 4680 

atttgttatt ttgttttttg gtgtgggaag gttatttagt g 4721 

<210> 59 
<211> 4721 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 59 

tgttgggtaa tttttttgtg ttgggaggta gggtggtggg tggtggaggt tttttgggtt 60 

tgggaggaaa tttgtttgtg gtgtgtaggt gagtttttat ttttgttttt ttttttttgt 120 

tgggatgttg ttttttatag tttgttagtt ggttaaatgt attttagagt tgtgtataag 180 

atatattttt ttaggtagtt ttagaaattg gagtttggag ttgagagggt gggagggagt 240 

gtgtggttgt tgttgaggtt atatttatgt tgtttttttg tttttggtgg ttgttttttt 3 00 

tagtgtatat tttttagggt tttggtttag tttggtataa atgtttttta aattgagata 360 

gtgttttttt ttttttgttt ttgggtgtgg ttaagggatt tattgttagg gtttttaaaa 42 0 

taaaattaat gaatgaagtg ttagggtgag tgtgtgtgag atggttattg tgaaatttag 480 

ttgggttgaa attgggttat tgatttgttt gtttagtttt aaaatttgtg gtgaagtttg 540 



tttggtgttt tttgtttttt taattttttt atggttttgg ttggggttta atttattttg 600 

ggagtttttt tgttttttat ttattttttt tttttttatt ggtatttttt ttatagattt 660 

gggagggggt ggtggtggtg gggggttggg gggagaaata gtttttagaa atttgatttg 720 

ttgtttgtga aatataattg tttttttttt ttttttaaag tgatagggtg tttagatggt 7 80 

tatgtgatga ggttggagtt gggtgtgtta ttgtgtagtg gaattagttg agtagagggt 840 

tggagggggg gtgtgggggg gtggggagga ggtggtgtgg ttgtggtggt tgggggtggg 900 

ggagggaagg gggaggaagg gggagggaag ggggtggggg tgggaggttt tgtgggaggt 960 

ggtgagtttt gggtatattt gtttgttgtt tggtgtatgg aagattttgt ttttgagttg 102 0 

tgttagttga gttagttggg tgttgtgttt ggtttgtttg ttgtgtggga gagagttgtt 1080 

tgagatagag ttagtttgtt ggtgttgagt tgttgagtgt ttggtttttg agtttttgag 1140 

tgtggtgtgg tgagtttttg gtggtggtag aaggattgga gtgttaggag agggtggatt 12 00 

ggggataagg aggtttttgg gtgtgatgag gagagttttg gaggaggagg tgttgagagg 1260 

atattggggt tttttgttgt tgtagttggg ttgaggtgag tagtttatgt ggggagtttt 1320 

ggtggttagt tgtggttagg ggaagggata ttggtggttt ttgtattagt aagtagtttg 1380 

tgtttggggg gggtgagaat gtgtgatttg tttggggttt tttgttgttg ttgtgttatt 1440 

tttttttgtg ttgttagttt tgttgggttt agttgttgtt gtagtatttt ttgtttttgt 1500 

gtttgtgtta gttttgagga agtttttgtt gaggatttgg gtttttagga gtgtaggggg 1560 

aaagattaga gattttttta aattatttag atgtgtagga ttgaagtggt ttagttaaag 1620 

ttttttgtgg attaaaaaaa tatatgattt tttttttttt ttttggtaga agaaaaggag 1680 

aggaagatta gtggggtttt gtaaggaaat ggggggatgt aatttgtgga tatatttttt 1740 

tatttattaa atattttgtt atattttgta aatatttttt ttttttaaat tttagtttta 1800 

gttggatgtt tttagatttt tgaggtttga ggaggtggtg gtttttattt ggggttttgt 1860 

atatttggtt gttaggtttt gtgagagttt ttttattttg ttagatgttt tatgtggggt 1920 

gaagtttatt tggttttttt ttttgagttt ttgtgtgtgt ggaatttgag gagatttggt 1980 

tattaaggat atagaggaaa aaaataaatt gtttgttttt ttttttttaa ttgtttgttt 2040 

ttttttattt ttaaattaag ttgtttagta agggggaaag aggttttttt ttttttttaa 2100 

tagtttagtt gaatgttttt tgttttttgt ttttgtggat tttttatgta ggaagttgag 2160 

gttggtgagt ttgatatttg ggagttattg tgggggggtt tttttttggg gaggtgttga 2220 

tgggggtagg tttggttgtt tttagggaag tggtggttgg gtttttttgg ggtgtgttgg 2280 

ggttggagag ttgtgtaggg tgtgggttgt gtggggtggg gtagttggag tgtaggtttt 2340 

tgatttttgg tgggtgtttt tgggtttttg tgtgtgtttt ggtttttggg agattggtgt 2400 

atgttatgga gtgttttttg ggttgttgtt gtttttgttt gggtttttgt tgttgttgtt 2460 

gttgtttgtg tttgttgttt taatttggtg tttgattttt ttatggtgtg tggaggttat 2520 

gtttgttttt tagtaggtaa atgatttttt tttttgtttt tttgttttgt atgtttagga 2580 

ttaaatgatt tgtgtttgtt tggtgttttt ggaggagttg tgtgtttggt ggtgaggatg 2640 

aggaggaggg tgtaggggga ggtggaggag gaggtgagtt gtggggagaa ggggtgatgg 2700 

atagttgagt gtatggggtt ggtggtggtg gtttgggtag ggttggatgt tgtttgggta 2760 

aggtggtgtg aggtgttaaa ggttattatt atttttattt gttagttgtg ggtgttggtg 2 82 0 

tggttggggg tgttgaggtg gatttgaagg tgtttatgta tttggtgttt aagaaattga 2880 

aggagtggta gttggagttg ttgttttagg ttgtggagtt ttgtggtggg atgtgtattg 2940 

tgtgtttttt gttgtttggt tgtttggatt gtaggttggg tttgggggtg tttgttggtg 3000 

tgtagtttgt gtagttgttt ttgttttatt tgtttttttt tttgttgtgt aaagtgttta 3060 

ggtggttgga ttttaggtat tttttggaag ttaagaggtt gtgttgttgt gaattttatg 3120 

ggaagattaa ttttgagttg gtgtgttgta atttttatta ttttagttga ttttgtgaat 3180 

taggtgagaa gttgtttttt tatttttaat attttttgga gtgttagtgt tggaagaaaa 3240 

gtttttaggg tagtgtgttt gtgtagttat ttttttaggg ggagtgtgta tatttttttg 3300 

ggagagtgtg tatatttttt tgggagagtg tgtatatttt tttgggaaga gattgggggt 3360 

tgaaagtaga gaagggataa ggggaaattt aaagggtatt ttttgggggt ttttttttag 342 0 

gttttttttt gtgttgtttt gaaggggttt atgggtattg tttttgtagt ttgtggttaa 3480 

aaattagatg tttttatttt tggattaata agggggtgta gagggaggtg ggggggttat 3540 

gtttttttgt atttggaaag atgttgaggt gtgtggttga gttggatttg gtttaggttt 3600 

gttaaggaag gttgggagag gtgtgttata aaggttggaa ggtttttgta tttatttttt 3660 

tagtaaatat ggagaattat gttgaatatt agtggtttag atattgttgt ggttgtgtat 3720 

ttttggagtt ttggggtaaa gagagttgga tggaaggttg aattgggtat ttgttttttt 3780 

ttaattttgt tagttttagg gtgtagagga ataaataggt aggagaagtt gggtgttttt 3840 

tatttttttt ttttttttgt atagataagt ttgtagatta gtggggtttt ggtgttttag 3900 

tgatgttgtt gttttttatt gtgttggggt ttgggttttt tggttgagat tttgaaattt 3960 

tttttttttt atttttttag ttttttttat tttttttttt attttatgtt tttgtatata 4020 

gagtgtgatg tgagggttta ttttgggtgt ttgtagattt tttatgtttt ttagtatgtt 4080 

tgggatgtat tttttgagtg tgttaggaat gggtagtttt gtgttttttg tgtgagtgtt 4140 

gagtggattg gtgggggtag aggtgaattg tatgtttggg gggtgggttg gggtggggag 4200 

ttgaggttgt tgggagttgg gtttggagtt tgaggatggg gagggggtat ggttgttttt 4260 

tgttttgtga attgtgaagt gtgtggtttt ggttgagttg gtggttttta gttgggttag 4320 

ttgagtgtgg tggttgtggt tgtagatgtg gaggagtgga gtgttgaggt tggtgttagg 43 80 

tggttttttt tgtttatttg atagttaaat attaaggtaa ttattgtttt gggttttgtt 4440 

tttaattttt atagttaaga taaatagttg ggttaaaaga atgtttttgt tattataagt 4500 

ttttattttt ttttttatgg ttgggttttt attttttttt gtttttttaa ttttagagtt 4560 

tttttttttt ttttatttta gatatttgat ggattttttt aaattaattg gtgagtagat 4620 

gatttttaaa atattttgtt tattttaagt tagaaataga gtttttagaa tgtggttttt 4680 

tgtgtgtagg taaaggtttg gggaggtgtt gggttttatg t 4721 



<210> 60 
<211> 2494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 60 

tttttatgaa gttataaagt tgtaattttg tttttatgtg tagtattata ggttttgttt 60 

ttaaatattt gtaggtgtat ttttttgtat ttgttaaatt aagggttttt tttttttgtg 120 

ttagatttgt ttttatgatt atgtttttta tttgatttat agaaagggtt ttatgattgt 180 

taggttattg ttgtttttgt attttattgg ggagttgaag gttttagtgt tgtttgtgat 240 

ttttttaagg tggttgggaa ggttttaatt atgttttttt gtaaattaga taaggaaggt 300 

ttggggattt tggttgtgga aaggtataga ggaggtgaat gttttggatg tttaagtgat 360 

gtatttgtag gtggatttgg aaatttgtgt ttttgagttg gaggtggagt ttttagattt 420 

agattgattt tttagaattg ggtggttttg tttgaggtat gaaggttttt taggtttgtt 480 

tttaaaagtt gtgtataatg gagatttttg gaataggtat atatttttat tgtaattaga 540 

atgagatgta atatttaaga tgtagggtag ggttttggta aagatatgtt gggggaatgt 600 

gaaagaaagg gttaaaatta tgtagttggt ttttagagat taaagattat gatttttagt 660 

atttaatgta gtgaattttt aggatgttgt atgttgggag ttgtagtttt ttattattgt 720 

tatttttttg tttttttgtt atttgaaagg gttttagggg atttgtaatt ttttttgtgt 7 80 

gttttttttt tttgagaagt gttgttatat gagaaagttg gttgtgaaag ttgtgttgga 840 

attattttta atgaaatttt aggtatagat gggaaagggt gaagaatatg ttgttatggt 900 

tattgttttt ttggttatgg aataaatgtt ttttaggatt tggaagtagt tttgttgtga 960 

tttttttaaa aggatggatg tgttttttgt ttatatttat tggatgtttt tttttagagg 102 0 

ttaaggttgt ttaggtaaag gggtggtttt atgtgtgagg ggtttgtgga gttatttgat 1080 

tggagaaaag ttgtaaattt tgattaattg gaaggagtta tgttttgggt attggttatt 1140 

gtatttggat agttttgatt ggtggatttt tgtttttttt atgaattttt attgggtgtt 12 00 

gtgggtgtgt ggtgtggtgt gattggtggg tttatgtttt ttgttttttg tttgtgagaa 12 60 

gtgggggtga aaagtggttt gatttgtttg gggtgtagtg ggtggattgt gtggttggag 13 20 

gtgtgaggat ttgaatttag gggtggaggt ggtttttgtg attgaagggg atttgagatt 1380 

tattggttgt atgttatgag ggttttgtgg gtgttgggtt tttgttgtgt tttgttgatt 1440 

tttggtgagt gattttggag gagtagatgt ttttttttta tatatgtggt tgttttttga 1500 

aggttttggg ggtgttgaat gtgggagggg ggattttggg gtttgtggtg ggttaaggga 1560 

tgttattatt ttttgaaaga gggtaaggga attatgttta tttttttttt tttttggaaa 1620 

taaaaaaaga gagtaatatt atttaattgt tttatagatt taatagtatt ttgtttggag 1680 

gttttagtga ttttggggtg gggttttttt gaggttttgg tttttttggg agttgtgtga 1740 

gttttttttt tttttttgtt tggaggattt ttttggttat tgtaattttt ggttatttta 1800 

gtttttttgt tttttaattt tttagatttt gattttttgt aagtttgtgt ttttttttag 1860 

atgttggttt tgggagtgtg aggtttgggg ttagtggaat ggtagtgtta gtgtgtttga 1920 

tttagttttg ttggagtttg gtttttaata tttttttttt tttttatgtt tttgggttta 1980 

attttagttt tttttgttag aggttgtata aattttattt gtattttgat ttgtgttaat 2040 

gaaggttgaa gttttggttt gattagttgg gaggttttgg tttggtttgt gtgtaatatg 2100 

tgtgggtgta gaagaggttt ggtattagtt tgtttaatgt tatttataaa tgattggaga 2160 

tatattgttt agaagtggtg attgtttttg tattttagga tttattggag attttagtat 2220 

ttgaatttaa atagatgttt tgttttggtt ttatttttgg tatttaaatt agggaatgta 22 80 

tttttttatt tttatttttt ttagttatta attgaatttt gtaattttta ttttttttta 2340 

tttgattatt ttttttggag aagtgtgatg ttgaagagtt tgtttgtgtt aaatttttag 2400 

ggttggttag agttgatgat gaagttgatg tggatggtat agtagaagag gatttgggta 2460 

aaagtagaga aggattaagg atggatgatg aagt 2494 

<210> 61 
<211> 2494 
<212> DNA 

<213> Artificial Secpaence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 61 



attttattat ttgtttttga ttttttttta 
tttatattaa ttttattgtt agttttgatt 
ttaatattat atttttttaa ggaggatgat 
tagttggtaa ttaagggagg tggggatgaa 
gtaaagttaa gatagaatat ttatttaagt 
aatatggagg tagttattat ttttaaatag 
agtagattga tattaggttt tttttatatt 
ttttttaatt gattaagtta aaattttggt 



tttttattta gatttttttt tattgtatta 
gattttgaag atttaatatg ggtaaatttt 
taaataggga agggtagggg ttgtaaaatt 
agagtgtatt ttttggttta aatattaaag 
ttagatgttg aggtttttag tagattttga 
tgtgtttttg attatttata aataatgtta 
tatatgtgtt gtgtgtaggt tgagttaagg 
ttttgttagt ataggttaga atgtaaatag 



60 
120 
180 
240 
300 
360 
420 
480 



gatttatgta atttttggtg aggggaattg aaattgagtt tagaggtatg agaaggggga 540 

gaaatgttaa gaattgaatt ttagtaaaat tgaattaaat atgttgatat tgttattttg 600 

ttggttttag gttttatgtt tttgaggtta gtatttaaga agggatatgg gtttgtgagg 660 

agttggagtt tggggaatta ggggatggga aagttgggat ggttgaaggt tgtggtggtt 720 

gaaaaagttt tttgggtggg gaggggggag gggtttgtat agtttttggg ggagttagag 780 

ttttagagag gttttatttt gaggttgttg agatttttgg gtgagatatt attagatttg 840 

tagggtagtt agatgatgtt gttttttttt tttattttta gaggaagaag agaataaatg 900 

taattttttt gttttttttt gaggaatggt gatgtttttt ggtttattgt aggttttggg 960 

attttttttt ttatgtttaa tgtttttagg atttttgaga agtggttgtg tgtgtggagg 1020 

ggggatgttt gtttttttag aattatttat tgaaggttag taggatgtag tagaggttta 1080 

gtatttatag ggtttttatg gtgtgtggtt ggtgagtttt aagttttttt tgattgtagg 1140 

agttgttttt atttttgggt ttggattttt atatttttag ttgtgtggtt tgtttattat 1200 

attttaagta ggttgggttg ttttttattt ttattttttg tgggtggggg atgggaaata 1260 

tgaatttatt aattgtgttg tattatgtat ttatggtatt taatgaggat ttgtgagggg 1320 

ggtggaagtt tattaattgg agttgtttag gtgtggtgat tgatgtttga agtgtggttt 1380 

tttttgattg gttagggttt gtagtttttt tttaattaaa tggttttgtg ggttttttat 1440 

atgtgggatt gtttttttgt ttgggtggtt ttggttttta agggaaaatg tttaatgaat 1500 

gtaagtagag aatatattta tttttttaga gaggttgtgg tggaattatt tttggatttt 1560 

agagagtgtt tattttgtga ttggggaaat ggtagttatg gtaatgtgtt ttttattttt 1620 

ttttatttat gtttggggtt ttgttggaag tgattttagt atgatttttg tggttagttt 1680 

ttttgtgtgg tggtgttttt tggaaagaag aaatatatga gagaggttgt aaatttttta 1740 

gaattttttt agatggtggg ggggtggggg ggtggtggtg gtgagaaatt atagtttttg 1800 

gtatgtaatg ttttaggggt ttattgtatt gaatgttggg ggttgtagtt tttggttttt 1860 

gaggattagt tgtgtagttt tggttttttt ttttatgttt ttttagtgtg tttttgttag 1920 

agttttgttt tgtgttttag atgttatatt ttgttttgat tatagtagag atatgtgttt 1980 

attttaggag tttttattgt atatagtttt tagaagtggg tttagaaagt ttttatgttt 2040 

tgggtaaggt tgtttggttt tggggggttg gtttgagttt gggggttttg tttttggttt 2100 

aagagtatga gtttttgaat ttgtttataa gtgtgttatt tgggtatttg gagtgtttat 2160 

tttttttgtg tttttttgtg gttggaattt ttaggttttt tttatttggt ttatagaaga 2220 

atgtggttgg gatttttttg attattttag ggaaattgtg agtagtgttg aggtttttga 2280 

ttttttaata aagtataaag gtggtggtga tttgatggtt gtagaatttt ttttgtgaat 2340 

taggtggaga gtgtggttgt aaaagtgggt ttagtataga aaaggaaaat ttttagttta 2400 

atgagtgtag gaggatgtat ttgtagatat ttgaaagtaa aatttgtggt gttgtatatg 2460 

agaatgggat tgtagttttg taattttata gaaa 2494 

<210> 62 
<211> 2497 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 62 

tgaagattgt gtggagtggg aggtttgtgt ttattttttt tatttattgg ggaatgggtt 60 

gagtggttgg gggtaaagtg ggatttaggg gttttgttta gtttttattg taggttaagt 120 

ttttttttgt ggttaagagt tattgggttt tttatttttt tttggttttt atttgtagat 180 

ttagatggtt ttatggtgat ttagaggagg atgtatttgg gtgttttttt tgaggatggt 240 

tgtagggatt atttagggat tgagttgagg tgtagatata gttatggatt tgggtgttgt 300 

tagatataga agggtgtgta gtaaagtgag ttttatgtat gtgtagattt tttttttggt 360 

ttttgttttt tatggaaata taattgatta tattatgatt gattatggta gttttttagg 420 

gttttttttt tttttttttt tttttttttt agaagttata aagggatata gaatatattt 480 

aaagattatt ttatgttttt agtggagagg taaaaaaggt ttgttttgga atggggaagt 540 

ggttgtgttt tggatggaga ggggtgggga ttttgtgatt gtaggtggag ggagggtggg 600 

tgttgttggt gtaggtggtg atagggagat attgttgtta ttgagtattt ttatgtaagt 660 

ttttttattt ttttgtgtat tagtgtttat attggggtaa tgataaatgt tgtgttgaaa 720 

aattatttga tggggttatg gaaggaatgg aaggaatggt gttttggttt gtttggggtt 780 

tgtgtatgtt gttattaagt tgtgggggtg ggttggaggg gagagttgtg ttagttaggt 840 

tgttgttaga tgatgagttt ggggtgtgat ggggtggagt atttttaaaa aagtgtatgt 900 

ttaggatttt gtttagtgta tttttgtgtg tggtgggttg ggttgggtag ttttataaat 960 

agttgtggtg tgagttttga agggaattat tagtgttttt tgtttatgga gttttaggtt 102 0 

tataatggta gtggttgtta gttttgtgtt ttttttgtgt tttttgtttt tgtatttgtt 1080 

gtttggttgg ttttgggtag gatgggttgg tgagtttggg gatgtagttt aagtagggtg 1140 

ggggttaaat ttgggaggtt gtggattgta gtgggtttta gaggagggga ggtttttgga 1200 

aggattggtg tgattttttt gaatgaatat tgtggttttt ttgaatgttg tggttttttt 1260 

gaatgttgtg gtttttttga atattgtggt gtttttgaat attgttgttt ttttgaatat 1320 

tgtgattttt ttgaatattg tgattttttt agatatgttt agttgaaggt atttagtttt 1380 

ttttggtggt ttttttttgt tgggtttgtt ttttgtggtt gttgtttgtt ttttaggtta 1440 

ggaggttttt ggaaggattg gtgttgtttt tttgaatatt gtggtttttt tgaatattgt 1500 

ggtttttttg aatattgttt tttttttgag taatgtgatt tttttgaata ttgtggtttt 1560 



tttgaaaatt 
tatgtttggt 
tgtggttgtt 
gaatattgtg 
ttgttgtttt 
tatttttgaa 
tgaatattgt 
tttttttttt 
ggtggattag 
atattgtggt 
tggttttttt 
ttttgaatat 
atattgttgt 
ttttgttgag 
gaaggttgaa 
gggtggggat 



gtgatttttt 
tgaaggtatt 
gtttgttttg 
ttttatttta 
tttgagtaat 
tattgttgtt 
gattttgttg 
gggtttgttt 
tgtgattttt 
ttttttgaat 
gaatattgtg 
tgtggttttt 
ttttttgaat 
tttgtttttt 
aaggaattgt 
ttttggtgta 



tgaatattgt 
tagttttttt 
taggttagga 
atattgtgat 
gtggtttttt 
tttttgtata 
aatatgtttg 
tttgtggttg 
ttgaatattg 
attgtgattt 
attttttaga 
ttgaatattg 
gtgtttggtt 
gtagttgttg 
aggagggtgg 
gatttgtagt 



tattttattg 
ttgtggtttt 
ggtttttggt 
ttttttgaat 
tgaatattgt 
ttgtgatttt 
gttgaaggta 
ttgtttgttt 
tggtttattt 
tttagaatat 
atattgtggt 
tgatttttta 
gaaggtattt 
ttagtattgt 
agggattaaa 
tttattg 



aatattgtga 
tttttgttgg 
agtattggtg 
attgtgattt 
ggtttttttg 
ttgaaatatt 
tttagttttt 
tataggttag 
gaatattgtg 
tgtggttttt 
ttatttgaat 
gaatattgtg 
agtttttttt 
agtttagggg 
aggttataga 



ttttgttgaa 
gtttgatttt 
tgatgttttt 
tttttgaata 
aatattgtgg 
gtgatttttt 
ttttgtggtt 
gaggtttttg 
gtttttttga 
ttgaatattg 
attgtggttt 
gtttttttga 
tggggttttt 
gagtgttaaa 
gggtaaggta 



<210> 63 
<211> 2497 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 63 

tagtggggtt 
ggttttttta 
gtgttagtag 
tgtttttagt 
gggattgtga 
gtgatgtttt 
ttttggggga 
tagattgtga 
taagtagtag 
ttttagttgg 
attgtaatgt 
atgtttgggg 
tggagggatt 
agaagttttt 
gttgtggagg 
tgagatggta 
tgtgttgttt 
gtttggggag 
tgtaggaagt 
tatgtttggg 
tgggggtatt 
gattgtgatg 
aatttgttgt 
gaatttattg 
tagaaggaat 
ggtgttgatg 
tttgttgtgt 
ttattttgtt 
tttgatttgt 
gttttttttg 
ttttagtgta 
tggtggtgtt 
tgtgaagttt 
ttttttgttt 
gatttttgga 
tatgatgtga 
tgtggggttt 
atttgtattt 
agtgtatttt 
aggtggaggg 
gtagagtttt 
aaaataggta 



gtgggtttgt 
tttttttgta 
tagttgtagg 
taggtatgtt 
tgtttgggga 
gggggattgt 
ttgtgatgtt 
tatttgggga 
ttgtaggaag 
gtatgtttgg 
atggggagat 
agattgtgtt 
gtgatgttgg 
tggtttgtgg 
ggaattgagt 
atgtttgggg 
ggagagaggg 
atagtattgg 
ggatttggtg 
gagattatga 
gtgatgtttg 
tttgtttagg 
agtttataat 
atttattttg 
gtggggttgg 
gttttttttg 
gtagggatat 
atgttttggg 
ttttgtggtt 
ttttttttat 
aatattgatg 
tttttgttat 
ttgttttttt 
ttttattaaa 
aaaaaaaaaa 
ttagttgtgt 
attttgttat 
tggtttagtt 
tttttgagtt 
tttagtggtt 
tgagttttat 
tagatttttt 



attagggatt 
attttttttt 
aagtggattt 

tggggagata 

gattgtgatg 
gatgtttggg 
tggggagatt 
gattgtgttg 
tggatttgga 
tgagattgtg 
ggtgatgttt 
atttgggggg 
ggtagaatgt 
ggtaagtagt 
gtttttagtt 
agattgtgat 
tgatgtttgg 
tttttttaga 
gaaaggagtt 
tgtttgggga 
gggagattgt 
gagattgtgt 
tttttaggtt 
tttgggatta 
tggttgttgt 
aggtttatat 
attgggtggg 
tttgttattt 
tggtggtggt 
ggttttatta 
tataggaaga 
tgtttgtgtt 
ttgtttaggg 
agtatgaggt 
aagaaaagaa 
ttttatggag 
atattttttt 
tttggatggt 
attatgaaat 
tttggttgta 
tttgttttta 
attttatata 



tttgttttat 
agtttttttt 
ggtggaaagg 
gtgatgtttg 
tttggggagg 
gagattgtga 
gtgatgtttg 
gtttatttag 
ggaaaggagt 
atgtttgggg 

gggggtattg 
atggtgatgt 
gatgtttggg 
agttgtagga 
gggtatgttt 
ttttggggag 
agaggttgtg 
aattttttgg 
attgagggga 
gattgtgatg 
gatgtttgga 
tggttttttt 
tggtttttgt 
gttagatagt 
tattgtagat 
tatggttgtt 
attttaggta 
gatagtggtt 
gtgtgtgggt 
aataattttt 
tgaagaaatt 
agtgatattt 
tgtagttgtt 
ggtttttgag 
agaaagtttt 
agtagaagtt 
gtgtttaata 
ttttgtagtt 
tatttagatt 
agggagggtt 
gttatttagt 
gttttta 



tttgtttttt 
gatatttttt 
agttttggag 
gggagattgt 
ttgtgatgtt 
tgtttgggga 
gggaggttgt 
aagttttttg 
tgtggagggg 
agattgtaat 
tgatgtttgg 
ttggggggga 
ggtattgtgt 
attagatttg 
ggtgagattg 
attgtgatgt 
atgtttgggg 
tttgaggggt 
attgagtgtt 
tttggggaga 
gggattgtga 
agaagttttt 
tttgtttagg 
aggtgtagaa 
ttggagtttt 
tataaagttg 
tgtatttttt 
tggttgatgt 
tttgagtggg 
taatatagta 
tgtataggaa 
gttttttttt 
tttttatttt 
tatattttat 
agagggttat 
gagggaagag 
atatttaaat 
atttttaaaa 
tgtgggtaga 
tagtttgtag 
ttatttttta 



gtagtttttt 
ttggattgtg 
gggaattgag 
gatgttttgg 
tgggtagatt 
ggttgtgatg 
gatgtttggg 
gtttatgggg 
aattgagtgt 
gttttggggg 
ggggattgtg 
ttatgatgtt 
tggtgttatt 
gtggaaagga 
tgatgtttgg 
ttggggagat 
agattatgat 
aagtagtagt 
tttagttggg 
tagtgatgtt 
tgtttgggga 
ttttttttga 
ttatattttt 
gtgggaggta 
gtggatagga 
tttagtttgg 
tggggatgtt 
aatttttttt 
ttaggatgtt 
tttattatta 
tatttagtgg 
tgtttgtagt 
ggagtgaatt 
atttttttat 
tgtgattaat 
tttatgtatg 
ttgtggttgt 
gggatgtttg 
agttagaggg 
taaggattgg 
atgaatgaga 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2497 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2497 



<210> 64 



<211> 3095 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 64 



ggttattata tttggtagtg gggtggtttt tagtttttgt tgtttttttg tttttttttt 60 

ttgtttgttt ttggagttta gttgattttt gaggttttaa ttttatttat tttttttttg 120 

ggttgttgtt gttgtgtttt tttttatttt tatttttttg aggagagtta taggttgtaa 180 

atttaattaa ttttgtaatt tatttttgta aaattattta taaagatttt ttttttgtgt 240 

ttgtgtttgt tttttttgtg ttgggttttt ttagttatgg ttataaagtg tttttttttt 300 

tttttgagtt ttgtatataa ggaatgtggg ttggggtttt gtttgttttt ttttttgttt 360 

aaggtaagga ttttgggaat ttgaagtttg gtgtttatta tgtttaggtt tgtagttttt 420 

tttttataga gtttgtatta tgggaaaaaa taaaataaaa tttaggaaag ggaggtaata 480 

gttattggga gttaatatag agttatgtag tgtttaaaat ataaatattg tagtggttag 540 

aaattttgtt attttttttg tttttttagg ttgttttgtt gaggtttttt gagttttttt 600 

gtatattgaa aggtattgta ggtgtagtgt gtattttttt tttatttatt ttaagaagtt 660 

ttgttttgtt attagttttt ttttttggga tgagtaggga gagtgtgtgg aggtttttga 720 

tttttttgat tataattaag aaagaataat ttttaaagtg tttaatattt ttgtttttaa 780 

gttttttaaa atataggggt agggaatatt aaaatatttg gtttttatta ggaagattat 840 

ggttttgaaa ggaaatagta gatatgatat tttattttat ttggatttat gattaaaaaa 900 

ataaaaataa aaatttaaag agtttgtttg tatttttttt ttttaaattt tggtttggtt 960 

tgaaggtagg gaatttaaaa gattgaggtt gatggaagag agttagtggg gtgagtgagt 1020 

gggtagtttt tttttttgtt ttttggagtt atttagaagg ataggggaag ggaaggaaga 1080 

agaggtgagg aaaaagagga gggagggaag tggaggttag gagtgatgga gtaaggaaag 1140 

tagtttgtaa gtgagaaaag agggaaaaaa tatagttgta tgaatttaga gagattataa 12 00 

gttgtatgta agtagtagta gaaagagtga gagtgtgagt gtgtgttttt tttgtggttt 1260 

ggggttagat agtttttaga ttagtttgaa ttatttttta agtattgttt tgtttttttt 1320 

gttttggttg ttttttaatt tttttttttt ttttttttta tttttttttt aaaaattaaa 1380 

ataatataag ggagggtggt aaaagttttt ttaaattggt tgatttattt aaagataata 1440 

ataataataa taaatatata ataatttata ttttatggtg ggagagatgt gggattaatt 1500 

tttggtattt attttaatat ttgatagtta gaataaataa atatatatat ttatattaat 1560 

agatatatat agaaaatttg gagttaaagt atttggtaag agtggaaaaa aaaagaatta 1620 

aaaggtaaaa taatgattat gagtagtggt ggtggtagtg gtattagtgg taatagtggt 1680 

ggtggtggta gtagtagtag tagtggtggt agtaatagta ataattattt ggtgtttggt 1740 

tttttttaga aattttttgt attattattt ttaagaattt tagttttaag aattaataga 1800 

gtttaatttt tggaatttga gttttggatt ttattattgt tatgtggtag gggaggattt 1860 

ggtgttagtt ttttgagatt tttattgttt ttggttaatt aaaagttttt aaagttataa 1920 

gattttttta ttggttggta tattttgagg tttttataag tagagtgttt tggatttgga 1980 

ggttttggtt tgaggtttga ggggtttgaa ggtggttttt tttttttggg tttaagatga 2040 

tggtatggtt tgttttgtta ttattatgtg ggtttttttt ttgtgatgtt ggtgtttttg 2100 

ttgtagtaaa gtttggtttt tggaattttg agaattaatt tgttatttgg tgatataaga 2160 

gggggagtgt gttttgtttt tttggggttt ggggttaatt tttttttttt tatttataaa 2220 

tttagtagat tgagttaaat gtataaaagg gagtgagagg tttgaattat tgggaaaagt 2280 

atgttatata tatagtaggg ttagagaggt gagtaagaga aaaataaaat aaaataaata 2340 

ttatagtttt ttttaattag aatattaggt attatgagaa aaatatttgt taagtagttt 2400 

ttggtgggtt tatttgtttt atttttattt aggatagggg tttttgttgt tgttttgggt 2460 

tttttttttt ttggtgtggt ggtttgggat ttttggtttt tgtattttga tggtttatgg 2520 

atttttgttt ttgatttttt gttttttgta agtttgtggt gttatgtaaa ttataggatt 2580 

ggtattggtt ggattttttt gtatgtgttt tttttttttt tatttaattt tttaagtgtt 2640 

ttaaagatgt attattttaa tattaatatt attgaaagaa gtttaaattt ttggttatat 2700 

gtaataattt tagtttttat tttttttttt tttttttttt ttggtgtaat tttttttttt 2760 

ttttttggat ttttgttgaa gtgtgttttt tttgtatttt agagaaatgt ttaaaggatt 2820 

tgttttggtt tggtttgttt ttttttagga tagtaagtgg tgggtttaat ttgttattgt 2880 

tgatttttgg gaaatttttt gttgtaagaa atgtgtgtgt gggggggagg gtgggggtgg 2940 

tggggtggta tgtgtgtgtt ttttataaaa ttttgtgagt taaatatttg tttgtgtttt 3000 

gttttttttt aaggttttga gatttttgtt tttgaggttt gttttaaggt tgttgtaaaa 3060 

aaattttttt agtttgtgtt taagagatta gttgg 3095 

<210> 65 
<211> 3095 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 65 



ttggttgatt 
ttgaaaataa 
tagaattttg 
tatgtttttt 
tgttgttttg 
gtaggagaga 
aaaaaaaaaa 
ttaatagtat 
agaaaaggta 
aaatttgtaa 
taaaaatttt 
gttttaaata 
gtggtgttta 
tatttgtttt 
tgtttttttt 
gttttagatt 
gtttttagaa 
gagtttatgt 
ttatttttag 
gaggattttg 
gttaggggta 
ggtaaagttt 
tttagaagtg 
ttgttgttgt 
ttgttgttgt 
taaatgtttt 
tattttagtt 
taggatatag 
tttggggagg 
gagaggaagg 
ggtgatttgg 
tgtttttgtt 
ttgtgttttt 
ttttgttttt 
tgggtaattt 
ttattggttt 
aaatgtaagt 
atgaagtatt 
gttttggtgt 
tgaaaattat 
gtttattttg 
ggtgtgtatt 
gatagtttgg 
gggtgttgtg 
ttttattttt 
gatgttaggt 
tttagtttgt 
gttgagggga 
gattttgtaa 
gagtaagaat 
gttttagaaa 
agttaggggt 



ttttaaatat 
aaattttaag 
tagaaaatat 
gtaataagaa 
gaaagaaa t a 
tatattttag 
aaaaagtggg 
tagtattgaa 
tgtataaaaa 
aaggtaaaaa 
aagttattat 
aaaataaagt 
atattttagt 
tttaatttta 
tgtgtattta 
ttgggaaagt 
ttttagaggt 
gatggtggtg 
gttttttgag 
aaatatgttg 
gtaaaaattt 
gaagtttaaa 
gtgatgtaag 
tgttgttgtt 
tgtttatgat 
ggttttaagt 
gttaggtgtt 
attgttatgt 
tttttgttat 
aggggaatta 
gttagtttgg 
ttttttgttg 
tttttttttt 
tttttttttt 
tgggaggtaa 
tggtttttta 
gaattttttg 
gtgtttatta 
tttttgtttt 
ttttttttgg 
aggagagaga 
gtatttgtga 
gagaatgaga 
tgattttgtg 
ttttatggtg 
tttagatttt 
gttttttatg 
tttggtgtgg 
aaatagattg 

gggggaaggt 
ttggttgagt 
tattttgttg 



agattgaaga 
attttaagag 
atatatatta 
attttttaag 
aattaaatta 
taaaagttta 
ggttgggatt 
ataatatatt 
aatttaattg 
attagaagta 
attagaaaga 
aaatgaattt 
tggaaagagt 
ttatatatat 
gtttgatttg 
aaagtgtatt 
tgagttttgt 
gagtaggtta 
ttttgaattg 
gttagtgaaa 
tggagagttg 
ttttgagaat 
aagtttttag 
gttattgttg 
tattatttta 
tttttatgtg 
aaaataaatg 
atttattatt 
ttttttttgt 

gggggtggtt 

gggt.tg.tttg. 
ttgtttgtgt 
tttgtttgta 
tttttttttg 
aaagggaggt 
gattttttgt 
ggtttttgtt 
ttttttttta 
tgtgttttgg 
ttgtagttga 
ttgatggtgg 
tgttttttag 
aaggtggtgg 
ttggttttta 
taagttttgt 
tgaggttttt 
tgtaagattt 
gaggagtggg 
tgaggttggt 
gtggtggtgg 
tttgggggtg 
ttggatgtag 



gattttttta 
aaaataaaat 
ttttgttatt 
agttaataat 
aaataaattt 
agggggaaaa 
gttatatatg 
tttaaaatgt 
atgttgattt 
aaaatttata 
aaaaaattta 
attgaaaatt 
tgtgatgttt 
aatatatttt 
ttgagtttat 
ttttttttta 
tatagtgaag 
tattattgtt 
gaatttttaa 
aaattttgtg 
atattaagtt 
tgagttttgt 
gaaaggttgg 
ttgttgttgt 
ttttttaatt 
tatttattga 
ttgaagatta 
attattgttg 
gttgttttgg 
ggagtagaga 
gttttagatt 
atggtttgtg 
aattgttttt 
tttttttttt 
tgtttgtttg 
ttttgagttg 
tttgtttttt 
gagttgtgat 
ggagtttggg 
gggagttggg 
gatagggttt 
tgtgtggggg 
gatttttggt 
atggttgttg 
aaaaagggaa 
attttgggtg 
aggaggagag 
tgtgggtgtg 
tggatttgta 
tggtttgggg 
ggtggggaga 
tgatt 



taatgatttt 
ataaataggt 
tttatttttt 
aatagattaa 
tttgaatatt 
agaaaattgt 
gttaaaaatt 
ttgagggatt 
tgtgatttat 
aattattaaa 
gaataatagt 
gtttggtaaa 
attttatttt 
ttttagtggt 
gggtaagaaa 
tgttattgaa 
gtgttgatgt 
ttgggtttgg 
atttgagatg 
gttttgaggg 
tttttttgtt 
tgatttttag 
atattaggtg 
tgttgttgtt 
tttttttttt 
tataaatgta 
gttttatgtt 
tttttgagtg 
tttttggaaa 
ggatgagata 
gtggagagga 
atttttttgg 
tttgttttgt 
tttttttttt 
tttgttttgt 
aattgagatt 
tggttataaa 
ttttttaatg 
ggtggggatg 
aatttttgtg 
tttggggtgg 
gatttaggga 
tgttgtggtg 
tttttttttt 
ttgtgggttt 
aggagaaaga 
aagggtattt 
aaagggagat 
atttgtggtt 
agggagtggg 
aagggtgggg 



gaaatgagtt 
atttggttta 
tttttatata 
atttattatt 
tttttgaagt 
attaaaggaa 
taagtttttt 
agatagggaa 
gtaatattat 
atatagaaat 
aaaaattttt 
tatttttttt 
attttttttt 
ttaaattttt 
gaaggaatta 
tagtaaatta 
tatagaggag 
ggagggagag 
ttttgtttat 
tttttggttg 
atgtagtagt 
aattggggtt 
attattgttg 
ggtgttgttg 
ttgtttttgt 
tatatttatt 
ttttttatta 
aattggttgg 
ggaggtggag 
gtgtttgggg 
tgtgtgtttg 
atttgtgtgg 
tgtttttggt 
tttgtttttt 
tggttttttt 
tggaaggaaa 
tttagatgag 
agagttgagt 
ttgaatattt 
tgtttttttt 
gtgggaaagg 
attttgatag 
tttgtatttt 
ttaaatttta 
gagtgtagtg 
tgaatagagt 
tgtggttgtg 
ttttgtgagt 
ttttttgagg 
tagagttgga 
gggtagtagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3095 



<210> 66 

<211> 2229 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 66 



tttttttttg 
ggtgatagaa 
tgtttttttt 
tgagggattt 
tattgggtga 
tagagaagag 
ttttggttgt 



gtgttggttg 
tgtgttgggg 
tggatatagt 
gtttttaggt 
ggaaggagta 
gtgtgttggg 
agttgggaag 



gtgtgggttg 
gtgggggttg 
ggaagttttt 
agtatgtaag 
ttttgaatgg 
ttgtattttt 
gtttattggt 



gggttaggtg 
gggttagggt 
tttgtattat 
ttgttgtttt 
agatgggggt 
ggaggttgtg 
tggaaagagg 



gagaagttgt 
tggtgtaatt 
taaatttttg 
gggtttattt 
gtttttggtt 
gtaattgata 
tgtttttttt 



tttttgttaa 
agggggttgt 
ttattttttt 
tgtatttttt 
tatatatttg 
ttagagaaga 
ttttagtaaa 



60 
120 
180 
240 
300 
360 
420 



gggttttgtt 
ttatttgaat 
taatttgatg 
tgggtgtata 
attgttttat 
ggggggttaa 
tttttttttg 
tattaaggta 
ttgggtaatt 
aagtttgaag 
aaggttaata 
tgagggtagg 
tgattttaga 
ttgttttttt 
tttagtttgg 
tgggattgtg 
aataaaatgg 
aaaaaatgtt 
ttttattgag 
ttttttttag 
gattatgtaa 
tttaataatt 
aaaaattaaa 
tttttgagta 
tagtagttgt 
atataaaata 
attttgtggt 
attgagttta 
ttggtttgtg 
gagaggtaat 
agggagaag 



tggaagggtt 
ttttttggat 
attgttattt 
tggttaggtg 
tttaaattgt 
ggatgttgag 
gtatatattt 
gggtgtgttt 
attttttttt 
tttggataaa 
tatttttatt 
aaggtgttga 
aattagagtt 
attgtgtgtt 
tggttgtggt 
taaatgtttt 
ggtttttggt 
tttgttttaa 
atggttgttt 
agttaagtat 
gttgattgat 
tttgtgtttg 
agtatttttt 
ggttttagtt 
tgttgttttt 
tattttagtt 
gtatggattg 
atttgaaaaa 
tttttttaga 
aatttgtttt 



gttttttatt 
ggtattatta 
tggtgaggat 
gtgtttggta 
tgtttgggtt 
gttttggggg 
ttttatttat 
tttttatgaa 
ttttttggtt 
tgtgttggat 
ttttatttta 
gattgagaag 
ttttgggatt 
ggtgttgtag 
gaggtttttg 
ggttgaagtg 
gttttgaagt 
tgttttataa 
taattaatag 
tttattatat 
ttaaaatatt 
gattagattg 
tttgttattt 
ttaggtttga 
gttgggtttt 
ttttaattaa 
tgttggtgta 
ttgggattgg 
ttttggtgtt 
taataaaaat 



tgtttagtgg 
tatagttggg 
ttgtgttgat 
ggtgatgttt 
taggtttttt 
taggaagttt 
ttattttgtt 
ttattttaag 
ttaggtattt 
gttaataatt 
gtttgttatt 
aagggatgtt 
ttgttgagat 
tgtgtgtggt 
gtgtagttgt 
ttattttatt 
attttttgaa 
tgtttaagga 
tgtatatata 
gtaaattata 
gagttttaat 
gtgaagtagg 
ttttttttaa 
gtttttttgt 
tgtttttgtt 
atttaaatat 
ggggaaataa 
gtttttaggt 
gagagtgttg 
ttgttgttat 



tattatagga 
tttttgtagt 
ggttggagaa 
gggtgtagga 
ggttttttga 
tttttggtta 
tatttttggg 
gtttttgagt 
tagtttaggg 
ttttattttt 
aaaataaagt 
ttggagaaag 
tttttgtagg 
ttagggtttg 
ttggaatttt 
taagaaatat 
atttttttaa 
aatatgtaaa 
tataataatt 
ataaagaaaa 
ttaggttttt 
ttatggaaat 
aattaaataa 
gattatttta 
ttttttgggt 
gattttggta 
atattttttg 
ggtattttag 
tttttgtggg 
tgaattgaaa 



tggttggttt 
gttggttttt 
ttttgtgttg 
tggtgttttt 
ataggggttt 
agtgtttttt 
gtgagaggtt 
tgtgggggtt 
gtttgtagag 
ggtagtagta 
tgtgtgtggt 
tgtgtttagt 
gtgttttaat 
gtgattttgg 
gtattagaat 
tgttgttagg 
aataatttat 
tggtttgttt 
tttttaattt 
gattgtgtaa 
tgttttttta 
taataaagta 
tagttgtttt 
tagttattta 
tgttttttgt 
gaatttatat 
gtatttaatt 
gggttttaat 
tgggtggatg 
gtgaaaggga 



<210> 67 
<211> 2229 
<212> DNA 

<213> Artificial Sequence 
<220> 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 67 



tttttttttt 
ttattttttt 
gtgggttagg 
agatttagtg 
ttatgaaatg 
gttttgtata 
tagttgttgt 
gtttaggaag 
ttaatttttt 
gttgttaaat 
tgtatgattt 
tgaggaggaa 
ttgataaaga 
atattttttg 
tattttgttt 
gtggttttga 
tgggttaagt 
ggaaaataga 
ttaagattag 
ttgtttttag 
attggttttt 
tttgggtttt 
gttgtttgga 
gttttggtga 
ggagagaaga 
tagtttttta 
tggggtggta 
gtgtgtttgt 
attggattgg 



tttttttgtt 
gtttatttat 
ttggagtttt 
gttaaatatt 
tgtaaatttt 
taagagatga 
gggtgattgt 
gggtgattgt 
attttgttaa 
agaaaaatag 
tgtataattt 
agttggaaga 
aatagattat 
taagttgttt 
ttgatagtag 
ttttaatgtg 
tggagttatt 
ttaaaatgtt 
ttgggtgtat 
ttgtgtgtag 
gttgttgtta 
tttgtagatt 
gtttttgtgg 
gttttttgtt 
gagaatgttt 
aatttttgtt 
agagtgttgt 
agtgtagggt 
gaaattaata 



tttgatttgg 
ttgtaaaagt 
tggggtgttg 
agagggtatt 
gttggggttg 
tttaaagaag 

ggggtgattg 
tgtttaattt 
tttttatagt 
agggtttgga 
tttttttatt 
attgttatgt 
ttatatgttt 
taagaaaatt 
tgttttttga 
aagttttaag 
aagttttgag 
ttatagaaaa 
tttttttggg 
ttttgttttg 
gggatgagag 
tttggattgg 
tttagaggtt 
ttgagggtag 
aattagagag 
taggaagttg 
tttgtatttg 
tttttggtta 
ttgtgaggat 



tggtgatagg 
agtgttttta 
tttaggggtt 
tatttttttt 
tatttgaatt 
gtagaaatga 
taggaaaatt 
tggagggagg 
ttgttttatt 
ttaaaattta 
ataatttata 
atgtgtatat 
ttttaaatgt 
ttagaagata 
atagggtgat 
tggttgtgtt 
ttgtatgtgt 
ttttggtgaa 
atgttttttt 
gtggtaaatt 
gttgttgatg 
ggtgtttgag 
ttgggatgat 
gtgaggtggg 
aattttttgt 
aaggatttag 
gatgttgttt 
ttagtatagg 
ttggttatgt 



tttttgttga 
gtgttaaggt 
tagttttgat 
gtattgatat 
tagttagaga 
aaatttgata 
tgagtttggg 
gatggtgaag 
agtttggttt 
atattttaag 
tatagtgaag 
tgttagttag 
tgtaaaatgt 
ttttggagta 
attttagtta 
aggaattttg 
tgtgatgttg 
gttttggagg 
tttttggttt 
gaaataagaa 
tttggtgtgt 
gttaggagag 
ttatgggggg 
tgggtagggg 
ttttggagtt 
gtttaggtaa 
gttaggtgtt 
tttttattga 
gatgatattg 



aagtagattg 
ttggggaggt 
tttttgaatt 
aatttatgta 
attggggtgt 
gaagtagtaa 
attgaagttt 
gaagatgttt 
aaatataaaa 
ttaattgatt 
tgtttagttt 
gatgattatt 
taaagtaaaa 
ttggggattt 
gggtatttgt 
ttgtgattgt 
gtatgtagta 
attttggttt 
tagtgttttt 
atggaaatat 
ttgtttgggt 
gagggggata 

ggtgtgtttt 
agtgtatgtt 
ttagtgtttt 
tggtttgggg 
atttggttat 
ggtgatagtt 
tttgggggaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2229 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



tttgagtgga agttgattgt tttgtggtgt tattagatag gtgagaagta gtttttttaa 1800 

atagggtttt ttgttggaag gaggaggtat ttttttttag ttagtgagtt tttttagttg 1860 

taattggggt tttttttaat attagttatt gtggttttta gaggtgtagt ttggtatatt 1920 

ttttttttgt agatgtataa attggggata tttttatttt tatttaagat gttttttttt 1980 

tatttagtag aggggtgtgg agtaaatttg ggataataat ttgtgtgttg tttggaagta 2040 

ggttttttag aaaggatgat aaaaatttgg tgatgtggaa gaagttttta ttgtgtttag 2100 

gaaagggtag tggtttttta gttgtattgg ttttggtttt ggtttttatt tttagtatgt 2160 

tttgttattt taataaagag tggttttttt atttgatttt aatttgtatt agttagtgtt 2220 

gaggaaaga 2229 

<210> 68 
<211> 2820 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 68 

tttaggggat aatatttagg tatgttaatg gagtttaaaa tgttaaggaa attatattat 60 

aattttgttt agtatattat aggttgttaa attgaaatgt tatgttagtt aggagtgtag 120 

taatttttat tttttggttt tatttaatta ggaagtttta gtagagtgaa gtttgttaag 180 

tgtttgttgt tagaatttga aggaatttga gtgagtaaga agagtgtttg atttatttta 240 

tagaagtttg tttagaaatg gaggagttag tgtttattga agttggtttt gtttttggtt 3 00 

tgtttatatg gagtttgatt agttttagtt atgtttattt tggtttggga gatttgtaaa 360 

gtgttttttt ttttaatttt tttgtattat tttgaagttt agggaagtaa agagaggggt 420 

atatttggat tgtaaaatta atgttttttg ttgtttagga gagaagggaa tgagagagag 480 

agagagatag atagatagag agagagagag agagagagag agagagagag agagagagag 540 

agaaatttta ttgaaattta gtttttttag aatttgtgtg atttggtttt taatgggaga 600 

ttagtgtgat tttatggtat ttttgttagg aattagtgat ttttttgtag ttattatttg 660 

atttattgtt tttttgttta ttttttttta taaagttatt ttttttttat tttagtaaga 720 

tttttttttt taatgatgat aaagtttttg ttttagtgtt ttttttagga ttggtgtttt 780 

tttaaaatag tgaatttaga aaattatttt gtttaatatt ttttaaaatt tttgtagttt 840 

taatgtaagt gtaagtatgt aaaggttttt tgttatattt gtattttttg tttattttag 900 

aattattttt tatttttggg tttgtaatag ttttttttgt ttttttggat agaggtgggt 960 

ggtattaggg gtttagggta gtaggaggtg aggggttgag gaggttgtta gggtaggttg 1020 

gtttgtgttg gatatgtgtg tttttttgtg gagttaaagg gttggggatg ggggttttgg 1080 

atttattaga gtaattttag ttggtgggtg tttggtagtt atttaaggag gtagggaaag 1140 

tagtgagttt tattgggtgg gttatgatga gtagtatgat gggtagtagt agtagttagt 1200 

aaaagttttt gtaaagtgtt tagttgttgt attgttgtgg ggatttttat agtattatga 1260 

ttagtttgtg taattttgta gtagtaaatg gtttttgagg aatataggat tgtgggggtt 1320 

gggtagtggg ttattgagta ttttgtggat ggtggtagta gaggtggtgg tggtggtagt 1380 

ggtatttggt ggggaagtag tagttaaatt tgtgtatgat tttgagagtt ttagtaatat 1440 

ttagggattg ggtttagttt tggagtgaga gggttgtttg ttgagaagtt gtgttggaga 1500 

tgtgggaagt tgttgttata aggagggagt tttgggaagt tggaggatag gaggagatgg 1560 

gagtttaggg gtagatgagt ggagtttgag gaggtagggt ggagggagag ttaaggtgtt 1620 

ttgtagtttg gtagttgttt tttgagtttt gttgtttgta tttttttggt gtttgggaag 1680 

tagtaggttt ttagtttgtt tggggttatg tgggaagagg tagttgggtt ttgattggtg 1740 

gagtaggatg taggttttgg gagggagggg ttgatgagga ggtgtaagga tgtaaggagg 1800 

aggtggttgt ggaagttata gatgggtttg tttgttaggt gttggtttga gtggggttag 1860 

gtggggatgg tttaaatgag aagtttgggt tttagggtgg gttatttgta tatttatata 1920 

ttatttgttt tattttttgt tttaggatgt tttttattga aggtggggtt tggattagtg 1980 

tttttttttt gtgtgtgatt ttgggttgtg agtgtgggtt gtggttgggt ggtgtttttt 2040 

tgagttggag atggtggggg tggaggtgtt agaggagtag tagtagtagg gtagagaggg 2100 

gtgagttggt gtgggagagg gtgttttgtt ggtgattggt gttttagtgt gtgggagtgt 2160 

gttgtttagg ttgtaggggg atgtaggttg ggaatgttgt ggtggagagg ttagggatgt 2220 

ttttttaggg atttatagga aagagggtga gaggtgatgg tgttagaatt gtttttgttg 2280 

atttggaagt aatagtagta ttttttataa gagtgtgtaa ttttaaggtt gtttgttgag 2340 

gtagtttagt tattttggta ggtgtttttt tttttttttt tttttttttt tttttttttt 2400 

aggttttttg tagttttgat ttagtttaag tgtttgtagg tttgaatttt tttttttatt 2460 

atttgttttt ttttagtttg tagtttatta gtgtgtttat ttgggaggtg tggttagatg 2520 

tgtttggaag gttagattgg ttgggataag tggtttgaga gaaagagaaa ggtttttttg 2 580 

tatatgttgt gggtgggttg ttgggagtat tggttgggta gtggtgtttg ggaaggggag 2640 

agtgggtttt atttgttggt ttaggtagtg attttgtgtt ttttatttgg gtttttgttg 2700 

gatggttggt gatttggggt gatgagagaa ggtttaattt ggtaggagtt tttggttttg 2760 

tgtgtttttt ttattttttt tagtgggaag ggtaaatggt atagtgggat ttgttttttg 2820 

<210> 69 
<211> 2820 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 69 

tggaaggtgg gttttgttat gttgtttgtt ttttttgttg gagagaatga aagaaatgtg 60 

tagagttaga gatttttgtt gagttagatt tttttttgtt gttttaggtt attggttatt 120 

tggtaaagat ttgagtaagg aatgtagggt tattgtttgg gttaataaat ggagtttgtt 180 

tttttttttt tggatgttgt tgtttggttg atgtttttgg taatttattt gtggtgtatg 240 

tagaggagtt tttttttttt ttttagatta tttgttttga ttaatttgat tttttaaata 300 

tatttgattg tattttttag gtggatatat taataggtta tgggttggag aggagtgggt 360 

gatgaggaga gggatttaaa tttgtgaatg tttgggttgg gttggagttg tggggggttt 420 

gggaggagag aggggagaag agagaaggaa ggagagtgtt tgttgggatg gttgagttgt 480 

tttggtgagt agttttgggg ttgtatgttt ttgtgggaga tgttgttgtt gtttttaggt 540 

tggtaagagt ggttttaata ttattgtttt ttattttttt ttttgtaaat ttttagagaa 600 

atgtttttgg tttttttgtt gtgatatttt tagtttgtat ttttttatag tttaggtggt 660 

gtgtttttgt atgttggagt gttggttgtt agtaggatgt ttttttttgt gttgatttgt 720 

ttttttttgt tttgttgttg ttgttttttt gatatttttg tttttattat ttttagtttg 780 

gagagatgtt atttagttgt ggtttgtatt tgtggtttgg ggttatgtgt ggaagagggg 840 

tgttagtttg gattttgttt ttggtagggg gtgttttgga gtggagagtg aggtgaatgg 900 

tatatgagtg tgtgggtagt ttattttgaa gtttgagttt tttatttgag ttatttttgt 960 

ttagttttat ttgggttagt gtttggtgag tgagtttatt tgtggttttt gtggttgttt 1020 

tttttttgta tttttgtatt tttttgttga tttttttttt ttgggatttg tattttgttt 1080 

tattaattag agtttgattg ttttttttta tgtgattttg ggtgggttga ggatttgttg 1140 

ttttttaaat gttagaggga tgtgggtggt agagtttgag aggtggttgt tgggttgtgg 1200 

ggtgttttga tttttttttt attttgtttt tttgggtttt atttgtttgt ttttggattt 1260 

ttgttttttt ttgttttttg gttttttaga gttttttttt tatggtagta gttttttgtg 1320 

tttttggtgt agttttttag tggatgattt ttttgttttg gggttgagtt tagtttttgg 1380 

atgttgttga aatttttgag attatgtgtg ggtttggttg ttgttttttt gttgggtgtt 1440 

attgttattg ttgttgtttt tgttgttgtt gtttgtggga tgtttagtag tttgttgttt 1500 

ggtttttgtg attttgtgtt ttttggaagt tgtttgttgt tgtagagttg tatgaattag 1560 

ttatggtgtt gtgggagttt ttgtggtagt gtagtagttg gatattttgt gagggttttt 1620 

gttggttgtt gttgttgttt gttatgttat ttattgtagt ttgtttggtg aagtttgttg 1680 

ttttttttat ttttttaagt gattgttaaa tgtttattgg ttggaattgt tttggtaagt 1740 

ttagaatttt tgtttttgat tttttaattt tgtagaagaa tatgtgtatt tagtatagat 1800 

tagtttattt tagtggtttt tttagttttt tattttttat tgttttagat ttttaatatt 1860 

atttattttt atttagagaa ataaggggaa ttgttgtagg tttgggggtg aggggtggtt 1920 

ttgggatggg tagaaagtgt aggtgtagta ggaaattttt gtatgtttgt gtttatattg 1980 

gagttgtgag gattttgaga aatattaaat gggatggttt tttgggttta ttgttttgaa 2040 

agagtattaa ttttagggga aatattgaaa tagaagtttt gttattatta aagaaaaaag 2100 

ttttattagg atgaggaaga aataatttta tgagaaagaa tgagtgagaa agtaataaat 2160 

taaatggtga ttgtagggga attgttgatt tttggtaaag gtgttatgag gttgtattgg 2220 

ttttttgttg aagattaggt tatatagatt ttagaggagt tgggttttaa tagaattttt 2280 

tttttttttt tttttttttt tttttttttt tttttttttt ttttatttat ttattttttt 2340 

ttttttttta tttttttttt ttttaggtgg taaaagatat tggttttgta gtttagatat 2400 

gttttttttt ttgttttttt aagttttaag gtagtatagg ggagttgaga aaaagaatat 2460 

tttgtgggtt ttttaggttg gagtgggtat gattgaggtt ggttaggttt tatgtaggtg 2520 

agttgagggt ggaattgatt ttagtgggtg ttgatttttt tatttttgga taggtttttg 2580 

tggagtgggt taggtatttt ttttgtttgt ttgggttttt ttagattttg atggtgaatg 2640 

tttggtaggt tttgttttgt tgaagttttt taattaaata gggttagagg atgggagttg 2700 

ttgtattttt agttggtata gtattttggt ttgatagttt gtagtatatt gggtagaatt 2760 

gtggtgtaat ttttttggta ttttaaattt tgttaatatg tttgggtatt gttttttagg 2820 

<210> 70 
<211> 2280 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 70 

attagtatag aattgttatt ataggattgt agtttaagta ttagagtgat attaattggt 60 

tatgtttaat tgttttaaaa ttatttttta aataattata ttgatattat aatagaaatt 120 

atgggtattt attttatatt tagatggaag gtattattgg atatgtatta aaaaaaagat 180 

tttttgaaaa aaataaaata aaataaaata ttattattaa aataaaagaa tttaaaataa 240 

tttttaaaaa ttttttttaa taaaatatat tgttaattta taaaatggat tgatgattag 300 

ttatgtaaat gttttaaata aaatttttat atttttttta tagttaattt atgttttgaa 360 



ttgtttattg 
atagaaagtt 
ttgtttttag 
gtagtatttt 
ttgtgtaatt 
ggaaaaatag 
gtggtattta 
agggttgtgt 
tttttagttt 
ttttttggtt 
tggggatttg 
tgtttttttt 
tatagttata 
agttatagat 
tggattgttt 
tgataaatat 
ggtaggagtt 
ttaatgtggg 
tgaaattttt 
tggaatttaa 
ggtttttgtg 
ggaggtatat 
tgtttatagg 
tggtattgaa 
ttaggttttg 
tgtatagtta 
tattgttgtt 
tgttgtggat 
aattagttaa 
tgtggttgta 
agttatttgt 
ttgtattttt 



attataaata 
tgatatgatg 
ttgtgggggt 
ttgttttttg 
gtttattttt 
atatttattt 
attttatagg 
tgggtttttt 
tttatagttt 
gtgtgagttt 
ggtatatgtt 
ttgttttttt 
tattttgtgg 
tgtttgtagg 
ttggtttttt 
tagtagtagt 
ggagtgtgtt 
tgagatttta 
tttaaagaag 
gtgttttgtt 
gaggtttatt 
gaagtaattg 
tagggatttt 
agtgtttggg 
tagttttgtg 
tttgaggttt 
tagatttaat 
tttgttattt 
gttgtttatg 
gttgtagttt 
tgttattgtt 
gtttggagta 



atggtgaaat 
gaatttatta 
gggtagagtt 
ttagaagtta 
gtgtagttaa 
ttttatatag 
ttttggggtt 
gttaaaaggg 
gttttttgtt 
atagttattt 
tttttagttt 
ttattggggt 
gttttatttg 
gaattaattt 
aggataaata 
tttgggatta 
ttggtttttg 
gtgtatagtt 
tttattggtg 
tattgtttga 
tggatgatgg 
ttgtgtaggt 
agtaggttgt 
aggatgagtt 
aatttttttt 
aaaaggtttt 
agaatgaggt 
tgtaggaaaa 
tttttgttgt 
ttagggtagg 
gttgtgggtt 
gattttggtg 



ggtatttttt 
ggtaagaggg 
tttggagtgt 
aagttattta 
tagggagtta 
atgtggatat 
gagggtgagg 
gtttttagta 
ttaaggaggg 
atttatttat 
ttggggtgtt 
tggggtttta 
tgttgtaagg 
tggatgtgga 
tgagtatgtt 
atattgagat 
ttaatttagt 
gagggtagat 
ttttatattt 
aaggtataat 
aggttaattg 
tgagtttaga 
tgttgagaat 
ggatagtgtt 
gggatagtgt 
tgaaagtgtt 
ttttgtaatt 
gtttttgtat 
tgtagtttat 
gtaaattgtt 
tgattttgat 
tggtatgatt 



gattatattg 
tgggtgttgt 
gtggatttta 
tttataatat 
ttatggggtt 
atgtatatat 
gtattgttag 
aagagtgagt 
agttaagtaa 
gtttatttag 
tgtggtgggg 
gtttgagagt 
tttattgtgt 
ggagttggta 
tattatggtg 
ggatattttg 
gttgttttta 
tttgttattg 
gaggttgttt 
tgtgtaggag 
gtttagtatt 
gagtgagatt 
gaggattttt 
gtattttatg 
gtttaggtaa 
gttttttata 
tataaagtgt 
gttagagagt 
ttttaaattt 
agttaagggt 
ttgttagttg 
tttttgttta 



tattttgttt 
gaatgagatg 
tgtgagttat 
atttatgttt 
aatttatagg 
gtatagaatt 
tttttttagt 
tggttgtgtt 
ggggttaggt 
gttgatggtg 
aaggatgtat 
gagtgagagt 
tagtaggaag 
ttttgttttt 
aaggtggagg 
ttggtgttta 
ttgtgtgaag 
gagaggagta 
attagtattt 
tatttttagg 
tttgttattg 
ttagtgaata 
agtttgggta 
tttaggtatt 
ttgttattta 
gtagtgatgt 
gatttttgga 
ttgggtttta 
gattttggga 
ttgtttttgg 
tttgggattt 
tttgttttga 



<210> 71 
<211> 2280 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 71 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



ttaagataaa 
agattttagg 
ttaggaataa 
ttttagggtt 
tgaagtttaa 
tttgaaaatt 
atattgttat 
tggatagtaa 
agtatttgaa 
tgtttaaatt 
tgtttgttgg 
tggtggtagg 
tttgggagtg 
aagtgttgat 
tgtttttttt 
ttttgtatag 
tagatattag 
tttttgtttt 
agagatgaga 
ttttttattg 
gtttttattt 
atatattttt 
tgttattagt 
gtttgatttt 
agtgtagtta 
gttggaagaa 
ggttttatat 
tttgtggatt 



tgggtaagag 
taattggtag 
atttttagtt 
gggtttaaag 
gttttttaat 
gtattttgtg 
tgtagagaat 
ttatttggat 
tgtggagtat 
gaggattttt 
ggttttgttt 
ggtgttggat 
tttttgtata 
gagtgatttt 
taatgatgag 
taaaaatagt 
tagggtgttt 
tattgtggtg 
tgttaatttt 
gtataatggg 
gtttttagat 
ttttattgta 
ttggatgggt 
ttatttagtt 
gtttgttttt 
ttgatagtgt 
atatatatat 
agttttgtga 



gattatgtta 
attaaaatta 
aatagtttat 
atgaattgta 
gtgtaggagt 
gattataaga 
aatattttta 
atgttgtttt 
aatgttattt 
atttttaata 
tttaaattta 
tagttaattt 
attgtgtttt 
aagtgtgaga 
atttgttttt 
attgggttgg 
attttggtgt 
ggtatgtttg 
tttgtgtttg 
ttttataatg 
taagatttta 
ggtattttgg 
ataagtagat 
ttttttttga 
tgttgagagt 
ttttgttttt 
atatttatat 
tggttttttg 



tattagggtt 
gatttatagt 
tttgttttgg 
ataataggaa 
tttttttatg 
attttatttt 
agaatttttt 
ggga.gaaatt 

agtttatttt 
ataatttgtt 
gtttgtataa 
ttattattta 
ttaagtagtg 
tgttggtgaa 
agttgtatgt 
tggagattgg 
tggttttggg 
tgtttatttt 
agattaattt 
tagatggggt 
attttaatag 
gggttggagg 
aaataattgt 
aataaagagt 
tttttttgat 
ggttttgggg 
ttatatagag 
ttggttatgt 



tgttttaaat 
agtgatagtg 
gggttgtagt 
tgtgagtagt 
agataataag 
gttggatttg 
ggattttagg 
tgtggggttg 
tttgggtatt 
gaggtttttg 
taattatttt 
gatagatttt 
ggtagaatgt 
tttttttaga 
taggattttg 
gatgtatttt 
attgttgttg 
gaggaattga 
tttatagata 
ttatagagtg 
gggagggtag 
ggtgtgtatt 
gagtttgtat 
agattgtgga 
agaaagttta 
tttgtggggt 
agatagatat 
agggatgggt 



ggaggtataa 
atgggtagtt 
tgtgattata 
ttggttgatt 
atttatagta 
ggtaataata 
tggttatata 
taaaatttag 
tttaatgtta 
tttgtggatg 
atgtattttt 
tatggaaatt 
ttgggttttg 
aaggatttta 
tttatgttaa 
aatttttatt 
gtgtttgtta 
aagtggttta 
gtttgtgatt 
tatgattgtg 
agggaaggtg 
taaatttttg 
aattgaaagg 
gagttgggag 
gtatgatttt 
tggatgttgt 
ttattttttt 
agttgtatga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



aggtatgaat gtattgtaaa taagtaattt tgatttttga taaaaaataa aaagtgttgt 1740 

atggtttgta tggaatttat gtgttttagg gattttgttt gtttttgtga ttggagatgg 1800 

tattttgttt atagtattta tttttttatt tgataagttt tattgtatta aattttttat 1860 

aaataaaata tagtataatt agaaagtgtt attttgttat tatttgtgat tggtaggtag 1920 

tttagagtat aagttaattg tgaaaaaaat gtaaaggttt tatttaggat atttgtatgg 1980 

ttagttatta gtttatttta tgagttaata atgtattttg ttgagggaag tttttagggg 2040 

ttgttttggg tttttttatt ttgatggtga tgttttattt tattttattt tttttagggg 2100 

gttttttttt taatatatat ttaataatgt tttttatttg aatgtaaaat aagtatttat 2160 

gatttttatt atagtattag tgtaattatt taaaaaatga ttttgaggta gttaagtatg 2220 

attaattaat gttattttag tgtttaggtt gtgattttat ggtagtaatt ttgtgttggt 2280 

<210> 72 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 72 

ttttgttttt atgtttagtt tattgaagtt tgtttttagg ttattggagt tgagtttttt 60 

gaatattttt ggtttagagt gttgtagatt tatgaattgt tgtttgttga aataaggtgt 120 

aaagtttatt ttgtttaaag tttttaatag gagtgataat ttaggtaaaa tgtttatata 180 

atagtgggaa attaaaagtt tgtgatatag aaaagtgata tatatttgta aattttttta 240 

ataaaggtat tgtagtataa gtatgattgt agagttagat tgttaatgtt taaatttttt 300 

ttgggtttta atttttttat ttgaaaattg ttaatagtgg aatttagttg atagggttgt 360 

tgtggagtaa gtgtttttaa tttggagttt atgaaatttt agtatttgtg aattttgtat 420 

tgaaaaaatt gtaattttat gtttattaat tttataatag ttttgttttt ttgaattata 480 

aaggtaggta ataaattata gtagtattat tagtatttat aattttgttt tttatagaaa 540 

ttatttttat atttagtaag ttgttgtaga tattttgaat aattatatgt atatattttt 600 

taagtattat atatataata tttttatatt ttgtatatat ttttggagat agggttttgt 660 

tttgtttttt aggttggagt gttgtagtat gattggaatt tattgtaatt ttaaattttt 720 

gggttttagt gattttttta gttttagttt tggaatagtt aggattataa gtatttttta 7 80 

ttatatttag ttgttaattt tttttttttt tgtaaagata aggttttatt attttgttta 840 

ggttgttttt taatttttgg atataagttt tttgtatagg ttttttaaag tattgggttt 900 

ataggtgtga gttattgtgt ttggttaata ttatgtattt aattttatgt gtttagaaat 960 

atgtgttatt ttaagaaggt gtttataggt tttattatat tattaaaggt gtttatggta 1020 

taagtgaaga aatttttaga tattagagta ataaaaatta tgatgtatta gaatggatta 1080 

ggtttatatt aaatgtttaa tatatagttt gttgttatgg tttttttttt tttttttttt 1140 

ttttttttga gatagagttt tattttgttt tttaggttgg agtgtagtgg agtgattttg 1200 

gtttattgta atttttgttt tttgggttta tgttattttt ttgttttagt tttttaagta 1260 

gaggaattgt aaggtgtttt attatgtttg gttatttttt gtatttttag tagaaatggg 1320 

gtttttttat gttatggtag gttggttttg aatttttgat tttaggtgag ttatttgttt 1380 

tagtttttta aagtgttggg attataggta tgagttatta tgtttagttt gttatggttt 1440 

ttaaaatatt tatagaatag aaattgaagt aagtttattt gtttttatat taaaagtagt 1500 

tagtatggtg ttttgtattt atgtatttta tggatgttga atgagttatt gaatgaagtg 1560 

gagtttaggt tgtgttttag gttttttagt ggggttggaa ggaatagttg tgaagttgtg 1620 

gggtggtttt tgtgtattgg gggattttgg agttagatgg ttttttaggg ttttgaagga 1680 

gaagagttgt gtttttaata atgattttaa tgggaataga gattgttggg gattgtaggg 1740 

gttaggttgt tattttttaa agtttttttt ttattattta agaaggtatt taggtttttt 1800 

tgtgttaggt tttaggtaaa gtgttggatt atttagtaat tgggtttagt agtaggatgg 1860 

ttttagattg aggttttagg gttaaaggat tatttttttt tttagtgttg gtttgggaaa 1920 

ggtaagtttt gggtgggagt gtatgttgtg tttttgaagt ttggtttttt tgttatgttt 1980 

attttttgtt tttattttgt gtttttttag gttttttttt ggtgaattgg atgttttgtt 2040 

agtttttttt tttgtgtttt tggttgtggt ttgggttttt tgtaaagttg ttgttatttt 2100 

ggagggttta gttagtgggt ttttggaggt tggttgggta ggtgtggtgt gtggtaggag 2160 

ttgggtgtgt atggttattg tgtgtggagg agatattgtt ttgttgtgat gggggtttgg 2220 

ggtgtttttt tatgttgtag gtaagtgggg tggtggttgt ggtatttgtt tattgggagt 2280 

tttttttttt tttttttgtt gttgttgttt tgtatttagt ttgggggagg atagaagaaa 2340 

aaggaggtag aatggatttt tttggttttt ttttgtgggt gggggtgggt tagggtgggt 2400 

tgtgttgtta ggtagttttg ttgtgttgtt aggtagtttg gttgttggtg tgggtgatgt 2460 

tggtgttggg 2470 

<210> 73 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 73 

tttagtgttg gtattgttta tgttggtgat taggttgttt agtaatatgg tagggttgtt 60 

tggtaatgtg gtttattttg gtttgttttt gtttataggg gaaggttaag gggatttatt 120 

ttattttttt ttttttttgt ttttttttga gttggatgta gagtagtagt agtaggagaa 180 

ggaagggaaa gtttttggtg ggtaggtatt gtagttgttg ttttgtttgt ttatggtgtg 240 

aaggagtgtt ttgggttttt attgtggtag ggtagtgttt tttttatgtg tggtagttgt 300 

gtgtgtttag tttttattgt gtattatgtt tgtttggtta gtttttggga gtttgttggt 3 60 

tgggtttttt gggatggtag tggttttgtg agggatttgg gttgtggttg aggatgtaga 420 

ggaggagatt gatagagtat ttggtttatt gggagaagat ttggaaaggt gtgggatggg 480 

ggtaggaagt gggtgtggtg agggagttag gttttggggg tgtggtgtgt gtttttgttt 540 

ggagtttgtt ttttttggat tagtgttgag gagaggagtg gttttttggt tttgggattt 600 

taatttgagg ttattttgtt attgaattta attattgggt agtttagtat tttatttggg 660 

gtttagtata gaaagatttg aatgtttttt tggatgatga aggggaagtt ttggggaatg 720 

ataatttggt ttttgtggtt tttagtaatt tttattttta ttgagattat tattaggaat 780 

atagtttttt ttttttagag ttttgggaaa ttatttaatt ttagagtttt ttgatgtata 840 

gaagttgttt tataatttta taattgtttt ttttagtttt attggaaggt ttgaggtata 900 

atttagattt tattttattt aatgatttat ttagtattta tggagtgtat aagtatagag 960 

tattgtatta gttattttta atataaaaat gaatagattt attttaattt ttattttatg 1020 

aatattttaa gaattatagt agattggata tggtggttta tgtttgtaat tttagtattt 1080 

tgggaggttg aggtaggtgg tttatttgag gttaggagtt taagattagt ttgttgtaat 1140 

atggggaaat tttattttta ttaaaaatat aaaaagtagt tgggtatggt ggagtgtttt 1200 

gtagtttttt tatttgggag gttgaggtag gagaatggta taaatttggg aggtagaggt 1260 

tgtagtgagt tgagattatt ttattgtatt ttagtttggg agatagaatg agattttgtt 1320 

ttaaaaaaaa aaaaaaaaaa aaaaaaagaa ttatagtagt agattgtatg ttgaatgttt 1380 

gatatgggtt tgatttgttt taatgtatta tggtttttat tattttagta tttagagatt 1440 

tttttatttg tgttatgaat atttttggta atgtgataaa atttgtagat gtttttttaa 1500 

aatagtatat atttttaaat gtatgaaatt aaatatatag tattagttag gtatggtggt 1560 

ttatatttgt aaatttaatg ttttgggaga tttatgtagg aggtttgtgt ttaagagttg 1620 

gagagtagtt tgggtaagat agtgagattt tgtttttatg aaaaaaaaaa aaattaatag 1680 

ttgggtatgg tggggagtat ttgtagtttt agttatttta agattgaggt tggaaggatt 1740 

attagagttt aggagtttga ggttatagtg agttttgatt atgttatagt attttagttt 1800 

gggagataga gtaaaatttt gtttttaaaa atatatatag aatataaaaa tattatatat 1860 

ataatattta aggaatatat atgtataatt gtttaagata tttgtaataa tttattgagt 1920 

atgaaaataa tttttgtagg aaataaagtt gtaggtattg ataatattat tgtggtttgt 1980 

tgtttgtttt tataatttaa agaaatgaaa ttgttgtgag gttaatagat ataaagttat 2040 

gattttttta atataaagtt tatagatgtt gaaattttat ggattttaga ttaagaatgt 2100 

ttgttttata ataattttat taattaggtt ttattgttaa tgatttttag ataaagaaat 2160 

taaggtttag aaaaaatttg aatattgata gtttggtttt atagttatgt ttatgttatg 2220 

atgtttttgt taaaggagtt tataaatata tattattttt ttgtattgta agtttttgat 2280 

tttttattat tatgtgaata ttttgtttga attattattt ttgttaaaaa ttttagataa 2340 

aataaatttt atattttatt ttagtaaata ataatttata aatttgtagt attttgaatt 2400 

agaagtgttt agaaagttta gttttagtgg tttgagagta agttttaata gattaaatat 2460 

aaaaataaag 2470 

<210> 74 
<211> 5907 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 74 

gttgttggag tagggtttta gtgtattgtg agtttttgtt ttttttgttt tgtagtgttg 60 

tgagtttttt ttttttttgt gtttggtttt gtttgtgttt tttttggtgt gggtatttgt 120 

agatttgtta ggaggttgat gagaggtgtt tttatagagt tttgtttttt tttagatttt 180 

taaatgaaaa gaaagagaaa agttaatttt ttgtttttat tttgtatgta tttggagagt 240 

ggttggtggt gagtgatttg tataattgtt ttttttgtgg tgtttttgga tgggtggaga 300 

gaatttgggt gattgagttt gaggttgagt ttttttggga gtgatttatg ttagttattt 3 60 

ttatttttaa gggggatttg tggttgggtt tatgtgtgaa ggatgggtgt tagggaaatt 420 

gatttataga tgatttgttt tgtgagagat tgataattta gttttgtggg tttttggggt 480 

gtttagatta gtttttagtt agaggatttg gggttgtata gtttttggat tttggagggg 540 

gtggtttggg ggtgggagat tgtgagtatt agtttatgtt ggtttttttt tattttgtgg 600 

tggtgaggtg ggaagggtta ggatgtgagt ttagtatttt ttttagaaat gtagtatttt 660 

gtttttttat tttttttgtt tttggtgttt ttttttttat gtgttttttt ttgttttttt 720 

tttaagggtg tttttttttt gtttttagtt tatgtttgaa ttttttggtg tttttttttt 780 

ttttttttag tttttttttt atgttttttg tttttgggta tttttttttt ttaatttttt 840 

atttttgtat tgtttgattt tgaggggtgg gagtgtattg ggttgtgtat gggtgggggt 900 



gttgtgttag ttttgtgtag ttgttttgat gttgttgttg ttgttgttgt tgttgttgtt 960 

ttttgtagtt tagtttgtgt tttgtggtag ttttgtagtg tattagttat tattgttgtt 1020 

gttgttgttt tgttagattt gttgttagtt tgtttggttt agttttgaga gagttttgaa 1080 

tgttagttgt gagggttatg agttagagag ttttggggtg ttgtgtggag agtaagtgga 1140 

gatagtgatt ttgtgttttt tagtttttgt ttttttgtat tgtgtttttt gtatttttgg 1200 

gtttttttgt tttttttgtt gtttttattg ttgttatggt tattttgttt tgtagtaagt 1260 

tgtttaatgt ggttatgttt gtgtttaata agttttaggt taagatgagt ggtatgtttg 1320 

ttaggatggg tttttaggtg gttatggatg aggaggtggt gggttttgtg tattgtgatg 1380 

attttgattt tgagtattgt tagggtttgt agatggatat tttgaaagtt gagggagagt 1440 

tttgtgggga tgagggtgtt gaagtgtttg ttgagggaga tatttattat tagtgaggta 1500 

gtggagtttt tttgttgttt tttggtttta aggattaggt gggaggtggt ggtgaatttg 1560 

ggggttatga taagtttaaa attatggtgt gggaggtagg ttggaatgtg attaatgtta 1620 

tttaggtaag tgtgggattt ttagttttgt ttgttttttt tttttttagt ttagtgtgtt 1680 

gggttttgtt tttgatagtt gtttggtgat tttggtttgg agattttttt ttgtatttag 1740 

gaattttttt ttttttattt tttttagttt tgtgtgggga tttatgtttt taggtggtgt 1800 

tttttgtttt aatttatgtt tttttttaat ggtattagtt gtaagattgt taggttgaag 1860 

tttggtttga gatatttgtt tggagatatt gtaaaagtga aggaaatggg gggagggagt 1920 

aggaagtgat gagaaagaaa gaaaattagg attggagggt atggtttggt tttggatttt 1980 

ggaatggatt tatagttgta tttttgggag gaaagaagaa ggggaaattg ttgaggttgg 2040 

agtttttttt ttttgtatat atgtatagat atgtatatgt atataggtag gtatattata 2100 

tgtttatagt tttttttttt atgaataaag gtgtaggtat aggtgtatat gatatgtatt 2160 

tatatataga ttatagtatt atttttaaat gtatttgtgt atatttttat agttttataa 2220 

ataaaagtat atatttgtat atattaaata tatgaatatt tgtttgaata tatgtatata 2280 

tatatgtata gatgtatgta gataggaatt agtttgtttt tttgggtgtg gttttgttgg 2340 

agatatatag gtatatatat atgaaagtat atatatgttt ttatataaat atatttagtt 2400 

tgtaataatt tagtattatt tgtgtataga tttgtttttt ttgtttgaaa ttttggttgt 2460 

attgtaaaga ggtggttagt ttttttttgt gtgattttgg tgaagttttt tttagaggtt 2 52 0 

aagaatggtt tttgtttttt aggttttttt tttttatttt taatttttta ttttttatta 2580 

ggtggtgtag gatttttata tagggagtag gtttggagtt ttggtaatga tttttgagtt 2640 

agtaggggtt ggtaggttgt agttttttgg gttgaaggag attttagttt taggtaattt 27 00 

tggtttttta tagtttggtt ttgtggtgtt tttggagtgt attattattg gggttatgga 27 60 

aggtaggttt tttgggagta gaggtttttt aggggttgtt ttatgtattg gggtaagtag 2820 

gattttattg ggttttgaat attagatggt ttgatttaag gtgttttttg ttttggggtt 2 880 

tggtgagatg gttttgttgg attatggatt tggggttgtt tttgtttttg ttgtgttttg 2940 

tgattttttg gggtgttttt taaggttttg tttgataggt tttaagggag gttagggtag 3000 

gaggttggag gtttgtaggt ttttgggatt ttggattttg ttagttttta gtttttgtgt 3 060 

ggattttaaa ttttatagtt ttgtttgttt gattttgaag tttttggttt tagatttttg 3120 

tttggtttgg ttttttggtt tttttggtgt ttttttgtta agtttttttt tttgtttgtt 3180 

tttttagttt ttatttagtt ttttttttta ggttttgttt ttgttttatt ttaattttag 3240 

ttttgtttta agatttttaa tttttagttt tgtttttttt tttttgtgtg ttgtttttta 3300 

agtgttttta tattgttttt gtttttgttt tttaggggtt ttgttttatt tttagttttt 3360 

tttttttttt attttatttt ttttttgttt tataattgta gtttttaatg tttttttttt 3420 

tggtttgagt ttgagatgtt tgagtttttt tttttagttt ttgtgtttta ttttagtttt 3480 

attataaatt ttgtgtttat tttttttatt ggtttaggtt tagtttatta ttttttttag 3540 

agtttttttt tgattatgtt tttttttgtt tttttttatt ttttttttta attttttatt 3600 

ttgtttttgt ttttagaaat tattttgaag tatttttttt tttttttgtg gttttttttt 3660 

atttttagtt tttttttatt ttagtttttt tgttattatt agttattagt gtattttttt 3720 

tttttagatt tgtttatata tttttttttt tttatttttg gtttgatatt tgggtttatt 3780 

attagttttt ttatatttat tttgtagttt gtttttaatt tttttttttt ggtggtttag 3840 

atttaatttt tagtgttttt agtgtttttt ttgggttata gtgttaagtt atgttttttg 3900 

ggttttttat ttgttgttaa gtttgttgag tgtttgtgtt tggttgtttt tttgttttat 3960 

agggtatgtt tgtgttgggt ttattttatg ttattttgta tggtggttat ttggggttgt 4020 

tttttattat ttttgttgtt gttgtgtgtt gttatattgg taagattttt attgtgtgtt 4080 

tgtatgagga gaatgaagat ggtgaggtgg tgtgtgtgtg ggatttgtat gtggttatag 4140 

ttaatgtttg ttgtgttttg tgttttttaa tgttgggtgg ttgagtggtg aatgtagtgt 4200 

agattattga gttggtgatg atgtgtattt tgtatgtggt ggtgagtggt aattttatgt 4260 

ataatagttt tttggggttg tttgtgttgt agaagttttg gtttattatt gttatggttg 4320 

tgttgttgtt ttgtgttttt tttaagaatt ttaaggttgt gtttaagttt agtttgttgt 4380 

gtattttggt ttattttgtt attaatattt tggttatagt ttattgttta ttgtgggtgt 4440 

gtgattgggt ttgggagaag gttaagtttt atattgatgt taagaagttt tttattttta 4500 

ttggtattat tgtgtttagt tatatgtttt agattttttt gtttttgttg gagggtaata 4560 

tgtagtagtt tagtgagttt tattgtatga tgaattggat gtatattgta gtttgtgtgt 4620 

ttaagggttt ttttgtgttt gttgtttatt ttatttgggt tgatgagatt aaggaggtta 4680 

ttatggataa tttgtttggt tttatttgtg ttgtggttaa tatttttttg gtggttaagg 47 40 

tgttgttgtt ttattttttg ttattttttg ttgttgttga ggtgttggag aagttgtttt 4800 

tttaggaagg tagttgtgtt ttttttttgg tttgttatag tggtgatggg tgtttgaagt 4860 

tttgggggtt gatgttgtgt tgtgtgtttg ttgtttttat gttgtttatg gttatttatg 4920 

tgttgtattt tgtgttgttt atgggtttta ttggtagttt tatgggtgtt ggtttttgtt 4980 

ttttgttgtt tagttttttt tatttgtgtt tgttttggtg taagttgttg tggtattaag 5040 

ttttttttga tgttgttatt tttgttattg gtggtatttg tagtgtgttt ggttttgtgt 5100 

atttttttga gggttttatt gaagtttatt gaattaatgt ggaggattag ggtgtaaggg 5160 



tgagtttttg ttgtgttttt gtgttttttt tttttttttt ttattttgtt tttattagtt 5220 

tagtgtgttt tgttgttgtg ttgggaggtt aagttttaaa tatttttggt ttttagtttt 5280 

tgattatttg gggatggggg ggatgggagg ggatagggat ttatgattta ttgtgtttgt 5340 

gtttttgttg tttttttttt tttataatat tttggttttg gggggaggtg gggtgtattt 5400 

gtgggtaggg ttttttgttt ttttaagtgg ggttttgata ttttggtttt agttattgag 5460 

ggggttggga agggagggag agggggtgta gtttgtaggt gtggtaattt gattttgggg 5520 

gaatatttta tatttattta gagtttggaa tttatagtgt ttagttattt ttagtaagag 5580 

tgttttttat tttggagatg ttttaatttt gtagtgggaa aggttgattg ggaaatttat 5640 

tttgggtggg taattttttt taatgaagtt ggaaggtgag aagttgtggt ggggttagtt 5700 

tgtttgttgg tttttaggaa tttaaatttt tattttgtgt aatttattag gtgtggaatt 5760 

gttttattgt gtgtgtggtg tgtttgtggt gaataagatg aaatgtatat tagaaaaaaa 5820 

tttattttta atttagagtg tggtatataa ttatatttgt aaataaagaa gagataaagg 5880 

tttgtgtggt ttggtgttgg gtttgtg 5907 

<210> 75 
<211> 5907 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 75 

tataaatttg atattgggtt gtgtaagttt ttgttttttt tttatttgtg gatataatta 60 

tgtattgtat tttaaattag agatagattt ttttttgata tatattttat tttatttatt 120 

atgagtatat tatatgtata gtagaatagt tttatatttg ataaattgta taagatgaga 180 

gtttagattt ttgaaaattg gtaggtaagt tggttttgtt gtggtttttt gttttttggt 240 

tttgttgaag gaaattgttt atttaaaatg gattttttag ttagtttttt ttgttgtagg 300 

gttgaaatgt ttttggaatg ggaagtgttt ttgttggaaa tggttggatg ttgtagattt 360 

tgagttttgg atggatgtga aatatttttt taaggttaag ttgttatgtt tgtgagttgt 420 

gttttttttt tttttttttt taattttttt gatgattgga attaaagtgt tggggtttta 480 

tttggaagga tagagaattt tgtttgtaaa tgtattttgt ttttttttaa aattagggtg 540 

ttgtggaaaa gaaaggataa tagaaatgta gatgtgatgg attgtgaatt tttgtttttt 600 

tttatttttt ttatttttga ataattagaa attaggaatt agagatgttt aaagtttggt 660 

tttttaatgt ggtggtaggg tgtattgggt tggtgggggt ggggtgaggg gagaagggag 720 

agagtgtaga agtgtggtgg gggtttgttt ttgtgtttta gttttttgtg ttggtttggt 780 

aggttttgat gaggtttttg agggagtgta tgaagttgga tatgttgtag atgttgttga 840 

tgatgaagat ggtgatgttg aagaagattt ggtgttatag tagtttgtgt tagagtaggt 900 

gtaggtgaaa gaggttgggt agtaagaaat agaggttggt gtttgtgagg ttgttggtga 960 

ggtttatgag tagtgtgaag tgtggtatat aaatggttat gagtagtgtg aagatgatga 1020 

gtgtgtagtg tagtgttagt ttttaggatt ttaggtgttt gttgttgttg tagtaggttg 1080 

ggaaaaaggt gtggttgttt ttttggaaga gtgatttttt tagtattttg atagtggtaa 1140 

agaatggtag aggataggat aatagtgttt tggttattag aaagatgttg attatggtgt 1200 

ggatggagtt gggtaggtta tttgtgatga tttttttggt tttgttggtt taggtgaggt 1260 

aggtgatgag tgtgaagagg tttttgagta tgtaggttgt gatgtgtgtt tagtttatta 1320 

tgtagtggaa tttgttgggt tgttgtatat tgttttttag tgaaggtagg aagatttgag 1380 

atgtgtagtt gaatatgatg atgttaatgg agatggggaa ttttttgatg ttgatgtaga 1440 

atttgatttt tttttaggtt tagttgtgtg tttgtgatag atagtaggtt atgattagga 1500 

tattgatgat gaagtgggtt agagtgtata gtagattgaa tttggatatg gttttgaggt 1560 

ttttaaggaa ggtgtaaggt agtagtatgg ttgtggtgat aatggattag gatttttgtg 1620 

atatgggtag ttttgggaag ttgttgtata tgaggttgtt atttattatt atgtatagga 1680 

tgtatgttat tattagtttg atgatttgtg ttatgtttat tatttggttg tttagtgttg 1740 

ggaagtgtgg ggtgtagtag gtgttggtta tggttatgta tgagttttgt atgtgtatta 1800 

ttttgttgtt tttatttttt ttgtataggt atgtgatgag gattttgttg gtgtagtagt 1860 

atataatggt ggtgaagatg atgagaaata attttaggta gttgttgtgt aggatggtgt 1920 

agggtaggtt tagtatgaat atgttttgtg gggtggagag gtaattagat gtggatgttt 1980 

agtgaatttg gtaatggatg aggggtttgg ggggtgtggt ttaatgttgt ggtttgaagg 2040 

gggtgttaag gatattgaga attgggtttg agttgttggg gaggagaggt taagagtagg 2100 

ttgtggggtg gatgtagggg aattaatagt gagtttgggt gttgagttag agatggagag 2160 

ggagggatgt atggatggat ttgagaagga aaggtgtgtt aatggttggt gatagtgaga 2220 

gggttagggt gggagagggt tgagaatagg gagaagttat aggagggaag ggagatattt 2280 

tgggatggtt tttggggata ggaatagaat gaggagttgg gggaggagat gagaagggat 2340 

agaagggagt gtagttggaa aggggtttta agaaaggtag tgagttgggt ttgggttagt 2400 

ggaaagagtg ggtatgggat ttgtggtggg gttggaatgg ggtatgggag ttaaagagga 2460 

aaatttgggt gttttaggtt tagattagag gaggaaatgt taaaggttgt agttgtgagg 252 0 

tgaaaaggaa atagggtaga gagggagagg ggttgaaggt ggggtaaagt ttttgggagg 2580 

taagaataga gataatgtga agatgtttgg ggagtagtgt gtggggagga ggaagtaggg 2640 

ttggaaattg agagttttag gataaggttg gagttagagt gaggtgaggg tgagatttaa 2700 

ggaaggggat taagtggagg ttggggaggt aagtaggagg aggagtttgg tgggaaggtg 2760 

ttagggaggt tagaaagttg aattgagtaa gagtttagaa ttagaagttt tagggttagg 2820 



tggatagggt tgtagggttt agaatttata tggggattaa gagttaatga aatttagggt 2880 

tttggaggtt tatgagtttt tagttttttg ttttggtttt ttttggggtt tattaaatgg 2940 

agttttgaaa gatgttttgg gaagttgtgg ggtatagtga aggtgggagt agttttgggt 3000 

ttgtgattta gtaaggttgt tttgttgggt tttggagtgg aaagtgtttt gggttgggtt 3060 

atttggtgtt tggggtttgg tggagttttg tttattttga tatatggaat aatttttggg 3120 

aggtttttgt ttttagaaaa tttgtttttt gtggttttag tgatagtgtg ttttagggat 3180 

gttgtggggt taagttgtaa gggattgggg ttgtttagag ttgaagtttt ttttggttta 3240 

gaaagttgtg gtttgttagt ttttgttagt ttggaggttg ttgttaagat tttaaatttg 33 00 

ttttttatgt aggagttttg tgttgtttgg tggggagtgg ggggttagga ataaggggag 3360 

agggtttaga gagtaggggt tatttttagt ttttggagag ggttttgtta gggttatatg 3420 

gggaggagtt ggttgttttt ttgtggtgta gttgggattt tgggtgagag gggtgagttt 3480 

gtgtataggt aatgttggat tattgtggat taaatgtgtt tgtgtgggag tatgtgtgtg 3540 

tttttgtgtg tgtatatttg tatgttttta atagggttgt atttaaggga gtaggttggt 3600 

ttttgtttgt atgtatttgt gtatgtgtgt gtgtatatgt ttaaatgaat atttgtgtgt 3660 

ttggtatgtg taggtatatg tttttgtttg tggagttgta ggggtgtatg taggtgtatt 3720 

tggggatagt attgtggttt atatatagat atatgttgtg tgtgtttgtg tttgtgtttt 3780 

tgtttgtggg aaggggggtt gtgggtatgt ggtgtgtttg tttatatatg tgtgtgtgtt 3840 

tatgtgtgtg tgtgggggga ggagattttg attttagtga tttttttttt tttttttttt 3900 

ttaaaaatgt agttgtgaat ttgttttaga gtttaagatt aaattgtgtt ttttaatttt 3960 

gatttttttt tttttttatt gttttttgtt tttttttttt atttttttta tttttgtagt 4020 

gtttttggat aggtgtttta gattgaattt tagtttagtg gttttgtagt tggtgttgtt 4080 

gggagggagt gtgggttagg gtggaaaata ttgtttgggg gtgtaggttt ttgtgtaggg 4140 

ttgggaggga tggagaaagg gagatttttg ggtataggag ggggttttta ggttgagatt 4200 

attgggtgat tgttgggggt agagtttggt atgttgagtt gggagggggg aggataggta 4260 

gaattgggaa ttttgtgttt atttggatgg tgttggttat gttttagttt gttttttatg 4320 

ttgtgatttt gggtttgttg tggtttttga atttgttatt attttttatt tggtttttgg 4380 

agttggaggg tggtagagga gttttgttgt tttgttgata atggatgttt tttttgatgg 4440 

gtgttttagt gtttttgttt ttgtagggtt tttttttggt ttttaggatg tttatttgta 4500 

ggttttggtg gtgtttaaag ttgaggttgt tgtaatgtgt gaagtttatt gtttttttat 4560 

ttgtggttgt ttgaaaattt attttggtga atatgttgtt tattttggtt tgggatttgt 4620 

tggatatgga tgtggttatg ttggatagtt tgttgtggag taaggtggtt atggtggtgg 4680 

tggggatagt ggaaaggata gaaggatttg aggatgtggg gaatgtgatg taagaaggtg 4740 

agggttgggg ggtgtaaagt tgttattttt gtttgttttt tgtgtggtgt tttggggttt 4800 

tttggtttat gatttttgta gttggtgttt gaggtttttt tagggttgga ttgggtaagt 4860 

tggtagtagg tttggtggag tggtggtggt ggtggtggtg gttagtgtat tgtggagttg 4920 

ttgtggggtg tgagttgggt tgtggagggt ggtggtggtg gtggtggtgg tagtggtgtt 4980 

agagtagtta tgtgaagttg gtgtggtgtt tttatttgtg tgtagtttaa tgtgtttttg 5040 

ttttttggaa ttaggtggtg tgggggtggg ggattggaga gaggaaatat ttggaggtag 5100 

gggatgtgag gaaggggtta ggagaagaga aagggggtgt tgagggattt agatgtgagt 5160 

tgggggtaga agaagggtgt ttttgggaag gaggtgggag ggagtgtgtg aaagagggag 522 0 

tgttgagggt aggggaggtg ggggggtgaa gtgttgtatt tttaggggag gtgttaagtt 5280 

tgtattttgg ttttttttat tttgttgttg tagggtggga gggggttggt gtgggttgat 5340 

gtttgtagtt ttttattttt ggattatttt ttttagggtt tgggagttgt gtagttttag 5400 

attttttggt taaggattgg tttgggtgtt ttgaaagttt gtaggattga gttgttagtt 5460 

ttttgtggag tgggttgttt ataagttagt ttttttggta tttatttttt gtgtgtaaat 5520 

ttggttgtga gttttttttg aaggtagaag tggttggtat ggattgtttt taaggaagtt 5580 

tagttttagg tttagttatt tgggtttttt ttgtttgttt agggatgttg tgagggaagt 5640 

gattgtgtgg attgtttgtt attagttgtt ttttaaatgt gtgtagaatg gaggtgaaag 5700 

gttggttttt tttttttttt ttgtttgggg gtttggaaga aggtaggatt ttatgaggat 5760 

attttttgtt agttttttag tgggtttgtg aatgtttatg ttgagagggg tgtagatggg 5820 

gttaggtgtg gaggaaagaa gggtttgtgg tgttgtaagg tggaggaagt agaggtttgt 5880 

ggtgtgttgg ggttttgttt tgatggt 5907 

<210> 76 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 76 



acggcgtttt ggttcgt 17 

<210> 77 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> chemically treated genomic DNA (Homo sapiens) 



<400> 77 

cgacttaata acgacgtacg cgaaccc 

<210> 78 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 78 

gtcatctaac aacgacctaa ctaacg 

<210> 79 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 79 

ctctaaaccg cgaaaactcc g 

<210> 80 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 80 

cgcgtttgtt cgggtttcgg g 

<210> 81 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 81 

gggatgagag gttgttgacg ttc 

<210> 82 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 82 

agggcgtttt gttggcg 

<210> 83 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 83 

atcggcgttt tagcgtgcgg g 

<210> 84 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 84 

ctacaaccta aacgacgcgc t 

<210> 85 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 85 

gaggtttcgg ttcgaggttc 

<210> 86 

<211> 36 

<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 86 

acgtcacaaa aaaaaaaccc acgtaataat aacgaa 

<210> 87 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 87 

ctttactaca acgaaaacgc eg 

<210> 88 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 88 

gattaaaaaa tttgttttta tteggteg 

<210> 89 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 89 

cgcggaagta gagtaggcgg gcg 

<210> 90 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA {Homo sapiens) 

<400> 90 

tccgcttcga aaacctcga 

<210> 91 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 91 

caaaatccgt cgacgataaa ac 

<210> 92 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 92 

tcgggttggg gttcgtattc gg 

<210> 93 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 93 

acgtagttat aggttttgat agttttaggg a 

<210> 94 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 94 

gaacgcgaaa aaattaacat ctcg 

<210> 95 
<211> 35 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 95 

tttcgttttt atcgtggtgg gtatgttcgt gttta 

<210> 96 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 96 

tttcgggatt gttgttggtg ttt 

<210> 97 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 97 

catcaatatc ctaatcataa cctactatct atcg 

<210> 98 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 98 

taggtttagt cgcgcgttcg cga 

<210> 99 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 99 

gagatgggga attttttgac gtc 

<210> 100 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 100 

cttaaaataa taacgcgaaa tataacgct 

<210> 101 
<211> 27 



<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> chemically treated genomic DNA (Homo sapiens) 
<400> 101 

cgaaaaacga tcgaaatccc gactttc 

<210> 102 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 102 

tttgcggaaa gacgtcgtt 

<210> 103 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 103 

ccaaaaatac gcgaccacg 

<210> 104 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 104 

ctaaaccact aatattcgac gtaattctcc gtatttac 

<210> 105 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 105 

cgttttttcg tattcggaaa gac 

<210> 106 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 106 

tacgtgtgag gggttcgc 
<210> 107 



<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 107 

ccaatcgaaa ctatccaaat acgataaccg a 

<210> 108 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 108 

ccaataaaaa ttcgtaaaaa aaaacg 

<210> 109 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 109 

cggttagtcg gaggcgc 

<210> 110 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 110 

cgaacgaaac aacctaaacg aaaaccccg 

<210> 111 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 111 

ccaaaataac caatcgaccc g 

<210> 112 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 112 



gggattaaga ttttcggtta gtttcg 



<210> 113 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 113 

caaacgaatc acataaaaaa cgtaataacc cgaa 

<210> 114 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 114 

aacgctacgc gactaaattc ga 

<210> 115 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 115 

gggttagggt gggtcgc 

<210> 116 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 116 

cgtcggcgtg ggcgatgtc 

<210> 117 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 117 

gactcccgcc ccaacg 

<210> 118 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 118 



cggagggtac ggagattacg 



<210> 119 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 119 

cgaaacccta aatatcccga ataacgccg 

<210> 120 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 120 

cgacgacgcg cgaaa 

<210> 121 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 121 

aaaaaaaaaa aactcctcta catac 

<210> 122 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 122 

ggttattgtt tgggttaata aatg 

<210> 123 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 123 

acatacacca caaataaatt accaaaaaca tcaaccaa 

<210> 124 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 124 



tttttttttt cggacgtcgt t 

<210> 125 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 125 

tcggtcgatg ttttcggtaa 

<210> 126 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 126 

ggtttgggat ggttgt 

<210> 127 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 127 

tctaaataac aaaatacctc catt 

<210> 128 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 128 

ccattaccaa cacaacccac caaccaa 

<210> 129 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 129 

ggtcgcggtg agagagagag 

<210> 130 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 130 



tgaaattaga gcgcgtttcg g 



<210> 131 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 131 

ggtgttgttt attttagaga gtt 

<210> 132 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 132 

cctaacccct atccttc 

<210> 133 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 133 

ccttcaccac cttcccaaca ctacaacac 

<210> 134 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 134 

tggggtttcg gcgaagtaa 

<210> 135 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 135 

tttttcggcg tcggtcgt 



